SEQUENCE LISTING 

(1) GENERAL INFORMATION: — 

(i) APPLICANT: 111, Charles R. et al. 

(ii) TITLE OF INVENTION: NOVEL VECTORS AND GENES EXHIBITING 

INCREASED EXPRESSION 

(iii) NUMBER OF SEQUENCES: 11 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LAHIVE & COCKFIELD, LLP 

(B) STREET: 28 STATE STREET 

(C) CITY: BOSTON 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: US 

(F) ZIP: 02109 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
'(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 04 DECEMBER 1998 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: . ^ 

(A) APPLICATION NUMBER: US 60/067,614 

(B) FILING DATE: 05 DECEMBER 1997 

(A) APPLICATION NUMBER: US 60/071,596 

(B) FILING DATE: 16 JANUARY 1998 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: REMILLARD, JANE E . 

(B) REGISTRATION NUMBER: 38,872 

(C) REFERENCE /DOCKET NUMBER: TTI-180 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617)227-74 00 

(B) TELEFAX: (617)742-4214 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4374 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: L. .4374 



1 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG GAA ATA GAG CTC TCC ACC TGC TTC TTT CTG TGC CTT TTG CGA TTC 4 8 

Met Glu lie Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe 
1*5 10 15 

TGC TTT AGT GCC ACC AGA AGA TAC TAC CTG GGT GCA GTG GAA CTG TCA 96 
Cys Phe Ser Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser 
20 25 30 

TGG GAC TAT ATG CAA AGT GAT CTC GGA GAG CTG CCT GTG GAC GCA AGA 14 4 

Trp Asp Tyr Met. Gin Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg 
35 40 45 

TTT CCT CCT CGC GTG CCA AAA TCT TTT CCA TTC AAC ACC TCA GTC GTG 192 
Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val 
50 55 60 

TAC AAA AAG ACT CTG TTT GTA GAA TTC ACG GTT CAC CTT TTC AAC ATC 24 0 

Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Val His Leu Phe Asn lie 
65 70 75 80 

GCT AAG CCA AGG CCA CCC TGG ATG GGT CTG CTA GGT CCT ACC ATC CAA 28 8 

Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr lie Gin 
85 90 95 

GCT GAG GTT TAT GAT ACA GTG GTC ATT ACA CTT AAG AAC ATG GCT TCC 336 
Ala Glu Val Tyr Asp Thr Val Val lie Thr Leu Lys Asn Met Ala Ser 
100 105 110 

CAT CCT GTC TCC CTT CAT GCT GTT GGT GTA TCC TAC TGG AAA GCT TCT 38 4 

His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser 
115 120 125 

GAG GGA GCT GAA TAT GAT GAT CAG ACC AGT CAA AGG GAG AAA GAA GAT 4 32 

Glu Gly Ala Glu Tyr Asp Asp Gin Thr Ser Gin Arg Glu Lys Glu Asp 
130 135 140 

GAT AAA GTC TTC CCT GGT GGA AGC CAT ACA TAT GTC TGG CAA GTC CTG 4 80 

Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gin Val Leu 
145 150 155 160 

AAA GAG AAT GGT CCA ATG GCC TCC GAC CCA CTG TGC CTT ACC TAC TCA 528 
Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser 
165 170 175 

TAT CTT TCT CAT GTG GAC CTG GTT AAA GAC TTG AAT TCA GGC CTC ATT 57 6 

Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu lie 
180 • 185 190 

GGA GCC CTA CTA GTA TGT AGA GAA GGG AGT CTG GCC AAG GAA AAG ACA 624 
Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr 
195 200 205 

CAG ACC TTG CAC AAA TTT ATA CTA CTT TTT GCT GTA TTT GAT GAA GGG 67 2 

Gin Thr Leu His Lys Phe lie Leu Leu Phe Ala Val Phe Asp Glu Gly 
210 215 220 
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AAA AGT TGG CAC TCA GAA ACA AAG AAC TCC CTC ATG CAA GAT AGG GAT 720 
Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gin Asp Arg Asp 
225 230 235 240 

GCT GCA TCT GCT CGG GCC TGG CCT AAA ATG CAC ACA GTC AAT GGT TAT 7 68 

Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr 
245 250 255 

GTA AAC AGG AGC CTG CCA GGA CTG ATT GGA TGC CAC AGG AAA TCA GTC 816 
Val Asn Arg Ser Leu Pro Gly Leu lie Gly Cys His Arg Lys Ser Val 
260 265 270 

TAT TGG CAT GTT ATA GGA ATG GGC ACC ACT CCT GAA GTG CAC TCA ATA 8 64 

Tyr Trp His Val lie Gly Met Gly Thr Thr Pro Glu Val His Ser lie 
275 280 285 

TTC CTC GAA GGA CAC ACA TTT CTT GTT AGA AAC CAT CGC CAG GCG TCC 912 
Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gin Ala Ser 
290 295 300 

TTG GAA ATC TCG CCA ATA ACT TTC CTT ACT GCT CAA ACA CTC CTC ATG 960 
Leu Glu lie Ser Pro lie Thr Phe Leu Thr Ala Gin Thr Leu Leu Met 
305 310 315 320 

GAC CTT GGA CAG TTT CTA CTG TTT TGT CAT ATC TCT TCC CAC CAA CAT 1008 
Asp Leu Gly Gin Phe Leu Leu Phe Cys His lie Ser Ser His Gin His 
325 330 335 

GAT GGC ATG GAA GCT TAT GTC AAA GTA GAC AGC TGT CCA GAG GAA CCC 1056 
Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro 
340 345 350 

CAA CTA CGA ATG AAA AAT AAT GAA GAA GCG GAA GAC TAT GAT GAT GAT 1104 
Gin Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp 
355 360 365 

CTT ACC GAT TCT GAA ATG GAT GTG GTC AGA TTT GAT GAT GAC AAC TCT 1152 
Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser 
370 375 "380 

CCT TCC TTT ATC CAA ATT CGC TCA GTT GCC AAG AAG CAT CCT AAA ACT 1200 
Pro Ser Phe lie Gin lie Arg Ser Val Ala Lys Lys His Pro Lys Thr 
385 390 395 400 

TGG GTA CAT TAC ATT GCT GCT GAA GAG GAG GAC TGG GAC TAT GCT CCC 12 4 8 

Trp Val His Tyr lie Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro 
405 410 415 

TTA GTC CTC GCC CCC GAT GAC AGA AGT TAT AAA AGT CAA TAT TTG AAC 12 96 

Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gin Tyr Leu Asn 
420 425 430 

AAT GGC CCT CAG CGG ATT GGA AGG AAG TAC AAA AAA GTC CGA TTT ATG 134 4 

Asn Gly Pro Gin Arg lie Gly Arg Lys Tyr Lys Lys Val Arg Phe Met 
435 440 445 
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GCA TAC ACA GAT GAA ACC TTT AAG ACT CGT GAA GCT ATT CAG CAT GAA 1392 
Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala lie Gin His Glu 
450 455 460 

TCA GGA ATC TTG GGA CCT TTA CTT TAT GGG GAA GTT GGA GAC ACA CTG 14 4 0 

Ser Gly lie Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu 
465 470 475 480 

CTC ATT ATA TTT AAG AAT CAA GCA AGC AGA CCA TAT AAC ATC TAC CCT 14 88 

Leu lie lie Phe Lys Asn Gin Ala Ser Arg Pro Tyr Asn lie Tyr Pro 
485 490 495 

CAC GGA ATC ACC GAT GTC CGT CCT TTG TAT TCA CGC AGA TTA CCA AAA 1536 
His Gly lie Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys 
500 505 510 

GGA GTA AAA CAT TTG AAG GAT TTT CCA ATT CTG CCC GGA GAA ATA TTC 158 4 

Gly Val Lys His Leu Lys Asp Phe Pro lie Leu Pro Gly Glu lie Phe 
515 520 525 

AAA TAT AAA TGG ACA GTG ACT GTA GAA GAT GGG CCA ACT AAA TCA GAT 1632 
Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp 
530 535 540 

CCT CGG TGC CTG ACC CGC TAT TAC TCT AGT TTC GTC AAT ATG GAG AGA 168 0 

Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg 
545 550 555 560 

GAT CTA GCT TCA GGA CTC ATT GGC CCT CTC CTC ATC TGC TAC AAA GAA 17 28 

Asp Leu Ala Ser Gly Leu lie Gly Pro Leu Leu lie Cys Tyr Lys Glu 
565 570 575 

TCT GTA GAT CAA AGA GGA AAC CAG ATA ATG TCA GAC AAG AGG AAT GTC 17 7 6 

Ser Val Asp Gin Arg Gly Asn Gin He Met Ser Asp Lys Arg Asn Val 
580 585 590 

ATC CTG TTT TCT GTA TTT GAT GAG AAC CGA AGC TGG TAC CTC ACA GAG 182 4 

He Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu 
595 600 605 

AAT ATA CAA CGC TTT CTC CCC AAT CCC GCT GGA GTG CAG CTT GAG GAT 1872 
Asn He Gin Arg Phe Leu Pro Asn Pro Ala Gly Val Gin Leu Glu Asp 
610 615 620 

CCA GAG TTC CAA GCC TCC AAC ATC ATG CAC AGC ATC AAT GGC TAT GTT 1920 
Pro Glu Phe Gin Ala Ser Asn He Met His Ser He Asn Gly Tyr Val 
625 630 635 640 

TTC GAT AGT TTG CAG TTG TCA GTT TGT TTG CAT GAA GTA GCA TAC TGG 1968 
Phe Asp Ser Leu Gin Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp 
645 650 655 

TAC ATT CTA AGC ATT GGA GCA CAG ACT GAC TTC CTT TCT GTC TTC TTC 2016 
Tyr He Leu Ser He Gly Ala Gin Thr Asp Phe Leu Ser Val Phe Phe 
660 665 670 
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TCT GGA TAT ACC TTC AAA CAC AAA ATG GTC TAT GAA GAC ACA CTC ACC 2064 
Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr 
675 680 685 

CTA TTC CCA TTC TCC GGA GAA ACT GTC TTC ATG TCG ATG GAA AAC CCA 2112 
Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro 
690 695 700 

GGA CTA TGG ATT CTG GGG TGC CAC AAC TCA GAC TTT CGG AAC AGA GGC 2160 
Gly Leu Trp lie Leu Gly Cys His Asn Ser Asp Phe Arg. Asn Arg Gly 
705 710 715 720 

ATG ACC GCC TTA CTG AAA GTT TCC AGT TGT GAC AAG AAC ACT GGA GAT 2208 
Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp 
725 730 735 

TAT TAC GAG GAC AGT TAT GAA GAT ATT TCA GCA TAC TTG CTG AGT AAA 22 5 6 

Tyr Tyr Glu Asp Ser Tyr Glu Asp lie Ser Ala Tyr Leu Leu Ser Lys 
740 745 750 

AAC AAT GCC ATT GAA CCA AGA AGC TTC TCC CAG AAC CCA CCA GTC TTG 2 304 

Asn Asn Ala lie Glu Pro Arg Ser Phe Ser Gin Asn Pro Pro Val Leu 
755 760 765 

AAA CGC CAT CAA CGG GAA ATA ACT CGT ACT ACT CTT CAA TCA GAT CAA 2 352 

Lys Arg His Gin Arg Glu lie Thr Arg Thr Thr Leu Gin Ser Asp Gin 
770 775 780 

GAG GAA ATT GAC TAT GAT GAT ACC ATA TCA GTT GAA ATG AAG AAG GAA 2 4 00 

Glu Glu lie Asp Tyr Asp Asp Thr lie Ser Val Glu Met Lys Lys Glu 
785 790 795 800 

GAT TTC GAC ATT TAT GAT GAG GAT GAA AAT CAG AGC CCC CGC AGC TTT 24 4 8 

Asp Phe Asp lie Tyr Asp Glu Asp Glu Asn Gin Ser Pro Arg Ser Phe 
805 810 815 

CAA AAG AAA ACA CGA CAC TAT TTT ATT GCT GCA GTG GAG AGG CTC TGG 2 4 96 

Gin Lys Lys Thr Arg His Tyr Phe lie Ala Ala Val Glu Arg Leu Trp 
820 825 830 

GAT TAT GGG ATG AGT AGC TCC CCA CAT GTT CTA AGA AAC AGG GCT CAG 254 4 

Asp Tyr Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gin 
835 840 845 

AGT GGC AGT GTC CCT CAG TTC AAG AAA GTA GTA TTC CAG GAA TTT ACC 25 92 

Ser Gly Ser Val Pro Gin Phe Lys Lys Val Val Phe Gin Glu Phe Thr 
850 855 860 

GAT GGC TCC TTT ACT CAA CCC TTA TAC CGT GGA GAA CTA AAT GAA CAT 2 64 0 

Asp Gly Ser Phe Thr Gin Pro Leu Tyr Arg Gly Glu Leu Asn Glu His 
865 870 875 880 

TTG GGA CTC CTG GGG CCA TAT ATA AGA GCA GAA GTT GAA GAT AAT ATC 2688 
Leu Gly Leu Leu Gly Pro Tyr lie Arg Ala Glu Val Glu Asp Asn lie 
885 890 895 
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ATG GTT ACC TTC AGA AAT CAG GCC TCT CGT CCC TAT TCC TTC TAT TCT 27 3 6 

Met Val Thr Phe Arg Asn Gin Ala Ser Arg Pro Tyr Ser Phe Tyr Ser 
900 905 910 

TCC CTC ATA TCA TAT GAG GAA GAT CAG AGG CAA GGA GCA GAA CCT AGA 278 4 

Ser Leu lie Ser Tyr Glu Glu Asp Gin Arg Gin Gly Ala Glu Pro Arg 
915 920 925 

AAA AAC TTT GTC AAG CCT AAT GAA ACC AAA ACT TAC TTT TGG AAA GTG 28 32 

Lys Asn Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe. Trp Lys Val 
930 935 940 

CAA CAT CAT ATG GCA CCC ACT AAA GAT GAG TTT GAC TGC AAA GCC TGG 28 8 0 

Gin His His Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp 
945 950 955 960 

GCT TAT TTC TCC GAT GTC GAC CTG GAA AAA GAT GTG CAC TCA GGC CTG 2 928 

Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu 
965 970 975 

ATT GGA CCC CTT CTG GTC TGC CAC ACC AAC ACA CTG AAC CCT GCT CAT 2 97 6 

He Gly Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His 
980 985 990 

GGG AGA CAA GTG ACA GTA CAG GAA TTT GCT CTG TTT TTC ACC ATC TTC 3024 
Gly Arg Gin Val Thr Val Gin Glu Phe Ala Leu Phe Phe Thr He Phe 
995 1000 1005 

GAT GAG ACC AAA AGC TGG TAC TTC ACT GAA AAT ATG GAA AGA AAC TGC 307 2 

Asp Glu Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys 
1010 1015 1020 

AGG GCT CCC TGC AAT ATC CAG ATG GAA GAT CCC ACT TTT AAA GAG AAT 312 0 

Arg Ala Pro Cys Asn He Gin Met Glu Asp Pro Thr Phe Lys Glu Asn 
1025 1030 1035 1040 

TAT CGC TTC CAT GCA ATC AAT GGC TAC ATA ATG GAT ACA CTA CCT GGC 3168 
Tyr Arg Phe His Ala He Asn Gly Tyr He Met Asp Thr Leu Pro Gly 
1045 1050 1055 

TTA GTA ATG GCT CAG GAT CAA AGG ATT CGA TGG TAT CTG CTC AGC ATG 3216 
Leu Val Met Ala Gin Asp Gin Arg lie Arg Trp Tyr Leu Leu Ser Met 
1060 1065 1070 

GGC AGC AAT GAA AAC ATC CAT TCT ATT CAT TTC TCC GGA CAT GTG TTC 32 64 

Gly Ser Asn Glu Asn He His Ser He His Phe Ser Gly His Val Phe 
1075 1080 1085 

ACT GTA CGA AAA AAA GAG GAG TAT AAA ATG GCA CTG TAC AAT CTC TAT 3312 
Thr Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr 
1090 1095 1100 

CCC GGA GTT TTC GAG ACA GTG GAA ATG TTA CCA TCC AAA GCT GGA ATT 3360 
Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly He 
1105 1110 1115 1120 
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TGG CGG GTG GAA TGC CTT ATT GGC GAG CAT CTA CAT GCT GGG ATG AGC 34 08 

Trp Arg Val Glu Cys Leu lie Gly Glu His Leu His Ala Gly Met Ser 
1125 1130 1135 

ACA CTT TTT CTG GTG TAG TCC AAT AAG TGT CAG ACT CCC CTG GGA ATG 34 5 6 

Thr Leu Phe Leu Val Tyr Ser Asn Lys Cys Gin Thr Pro Leu Gly Met 
1140 1145 1150 

GCT TCT GGA CAC ATT AGA GAT TTT CAG ATT ACA GCT TCA GGA CAA TAT 3504 
Ala Ser Gly His lie Arg Asp Phe Gin lie Thr Ala Ser Gly Gin Tyr 
1155 1160 1165 

GGA CAG TGG GCC CCA AAG CTG GCC AGA CTT CAT TAT TCC GGA TCA ATC 3552 
Gly Gin Trp Ala. Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser lie 
1170 1175 1180 

AAT GCC TGG AGC ACC AAG GAG CCC TTT TCT TGG ATC AAA GTT GAC CTG 3600 
Asn Ala Trp Ser Thr Lys Glu Pro Phe Ser Trp lie Lys Val Asp Leu 
1185 1190 1195 1200 

TTG GCA CCA ATG ATT ATT CAC GGC ATC AAG ACC CAG GGT GCC CGT CAG 364 8 

Leu Ala Pro Met lie lie His Gly lie Lys Thr Gin Gly Ala Arg Gin 
1205 1210 1215 

AAG TTC TCC AGC CTC TAC ATC TCT CAA TTT ATC ATC ATG TAT AGT CTC 3696 
Lys Phe Ser Ser Leu Tyr lie Ser Gin Phe lie lie Met Tyr Ser Leu 
1220 1225 1230 

GAT GGG AAG AAG TGG CAG ACT TAT CGA GGA AAT TCC ACT GGA ACC CTC 37 4 4 

Asp Gly Lys Lys Trp Gin Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu 
1235 1240 1245 

ATG GTC TTC TTT GGC AAT GTG GAT TCA TCT GGG ATA AAA CAC AAT ATT 37 92 

Met Val Phe Phe Gly Asn Val Asp Ser Ser Gly lie Lys His Asn lie 
1250 1255 1260 

TTC AAC CCT CCA ATT ATT GCT CGA TAC ATC CGT TTG CAC CCA ACT CAT 38 4 0 

Phe Asn Pro Pro lie lie Ala Arg Tyr lie Arg Leu His Pro Thr His 
1265 1270 1275 1280 

TAT AGC ATT CGC AGC ACT CTT CGC ATG GAG TTG ATG GGC TGT GAT TTA 38 8 8 
Tyr Ser lie Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu 
1285 1290 1295 

AAT AGT TGC AGC ATG CCA TTG GGA ATG GAG AGT AAA GCA ATA TCA GAT 3936 
Asn Ser Cys Ser Met Pro Leu Gly Met Glu Ser Lys Ala lie Ser Asp 
1300 1305 1310 

GCA CAG ATT ACT GCT TCA TCC TAC TTT ACC AAT ATG TTT GCC ACC TGG 398 4 

Ala Gin lie Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp 
1315 1320 1325 

TCT CCT TCA AAA GCT CGA CTA CAC CTA CAA GGG AGG AGT AAT GCC TGG 4 032 

Ser Pro Ser Lys Ala Arg Leu His Leu Gin Gly Arg Ser Asn Ala Trp 
1330 1335 1340 
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AGA CCT CAA GTT AAC AAT CCA AAA GAG TGG CTG CAA GTG GAC TTC CAG 4 080 

Arg Pro Gin Val Asn Asn Pro Lys Glu Trp Leu Gin Val Asp Phe Gin 
1345 1350 1355 1360 

AAG ACA ATG AAA GTC ACA GGA GTA ACT ACT CAG GGA GTA AAA TCT CTG 4128 
Lys Thr Met Lys Val Thr Gly Val Thr Thr Gin Gly Val Lys Ser Leu 
1365 1370 1375 

CTT ACC TCT ATG TAC GTG AAG GAG TTC CTC ATA TCG TCG TCG CAA GAT 4176 
Leu Thr Ser Met Tyr Val Lys Glu Phe Leu lie Ser Ser Ser Gin Asp 
1380 1385 1390 

GGC CAT CAG TGG ACT CTC TTT TTT CAA AAT GGC AAA GTA AAA GTT TTC 4 224 

Gly His Gin Trp Thr Leu Phe Phe Gin Asn Gly Lys Val Lys Val Phe 
1395 1400 1405 

CAG GGA AAT CAA GAC TCC TTC ACA CCT GTC GTG AAC TCT CTA GAC CCA 4 2 72 

Gin Gly Asn Gin Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro 
1410 1415 1420 

CCG TTA CTC ACT CGC TAC CTT CGA ATT CAC CCC CAG AGT TGG GTG CAC 4 320 

Pro Leu Leu Thr Arg Tyr Leu Arg lie His Pro Gin Ser Trp Val His 
1425 1430 1435 1440 

CAG ATT GCC CTG AGG ATG GAG GTT CTG GGC TGC GAG GCA CAG GAC CTC 4 3 68 

Gin lie Ala Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gin Asp Leu 
1445 1450 1455 

TAC TGA 4 37 4 

Tyr 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9164 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1006.. 5376 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GTCGACGGTA TCGATAAGCT TGATATCGAA TTCCTGCAGC CCGGGGGATC CACTAGTACT 60 

CGAGACCTAG GAGTTAATTT TTAAAAAGCA GTCAAAAGTC CAAGTGGCCC TTGCGAGCAT 120 

TTACTCTCTC TGTTTGCTCT GGTTAATAAT CTCAGGAGCA CAAACATTCC TTACTAGTCC 180 

TAGAAGTTAA TTTTTAAAAA GCAGTCAAAA GTCCAAGTGG CCCTTGCGAG CATTTACTCT 24 0 

CTCTGTTTGC TCTGGTTAAT AATCTCAGGA G C AC AAAC AT TCCTTACTAG TTCTAGAGCG 300 

GCCGCCAGTG TGCTGGAATT CGGCTTTTTT AGGGCTGGAA GCTACCTTTG ACATCATTTC 360 
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CTCTGCGAAT GCATGTATAA TTTCTACAGA ACCTATTAGA AAGGATCACC CAGCCTCTGC 4 20 

TTTTGTACAA CTTTCCCTTA AAAAACTGCC AATTCCACTG CTGTTTGGCC CAATAGTGAG 4 80 

AACTTTTTCC TGCTGCCTCT TGGTGCTTTT GCCTATGGCC CCTATTCTGC CTGCTGAAGA 54 0 

CACTCTTGCC AGCATGGACT TAAACCCCTC CAGCTCTGAC AATCCTCTTT CTCTTTTGTT 600 

TTACATGAAG GGTCTGGCAG CCAAAGCAAT CACTCAAAGT TCAAACCTTA TCATTTTTTG 660 

CTTTGTTCCT CTTGGCCTTG GTTTTGTACA TCAGCTTTGA AAATACCATC CCAGGGTTAA 720 

TGCTGGGGTT AATTTATAAC TAAGAGTGCT CTAGTTTTGC AATACAGGAC ATGCTATAAA 7 80 

AATGGAAAGA TGTTGCTTTC TGAGAGATCT CGAGGAAGCT AACAACAAAG AACAACAAAC 84 0 

AACAATCAGG TAAGTATCCT TTTTACAGCA CAACTTAATG AGACAGATAG AAACTGGTCT 900 

TGTAGAAACA GAGTAGTCGC CTGCTTTTCT GCCAGGTGCT GACTTCTCTC CCCTTCTCTT 960 

TTTTCCTTTT CTCAGGATAA CAAGAACGAA ACAATAACAG CCACC ATG GAA ATA 1014 

Met Glu lie 
1 

GAG CTC TCC ACC 
Glu Leu Ser Thr 
5 

GCC ACC AGA AGA 
Ala Thr Arg Arg 
20 

ATG CAA AGT GAT 
Met Gin Ser Asp 



AGA GTG CCA AAA 
Arg Val Pro Lys 
55 

ACT CTG TTT GTA 
Thr Leu Phe Val 
70 

AGG CCA CCC TGG 
Arg Pro Pro Trp 
85 

TAT GAT AC A GTG 
Tyr Asp Thr Val 
100 

AGT CTT CAT GCT 
Ser Leu His Ala 



TGC TTC TTT CTG TGC CTT TTG CGA TTC TGC TTT AGT 10 62 

Cys Phe Phe Leu Cys Leu Leu Arg Phe Cys Phe Ser 
10 15 

TAC TAC CTG GGT GCA GTG GAA CTG TCA TGG GAC TAT 1110 
Tyr Tyr Leu Gly Ala Val Glu Leu Ser Trp Asp Tyr 
25 30 35 

CTC GGT GAG CTG CCT GTG GAC GCA AGA TTT CCT CCT 1158 
Leu Gly Glu Leu Pro Val Asp Ala Arg Phe Pro Pro 
40 45 50 

TCT TTT CCA TTC AAC ACC TCA GTC GTG TAC AAA AAG 1206 
Ser Phe Pro Phe Asn Thr Ser Val Val Tyr Lys Lys 
60 65 

GAA TTC ACG GTT CAC CTT TTC AAC ATC GCT AAG CCA 12 54 
Glu Phe Thr Val His Leu Phe Asn He Ala Lys Pro 
75 80 

ATG GGT CTG CTA GGT CCT ACC ATC CAG GCT GAG GTT 1302 
Met Gly Leu Leu Gly Pro Thr He Gin Ala Glu Val 
90 95 

GTC ATT ACA CTT AAG AAC ATG GCT TCC CAT CCT GTC 1350 
Val He Thr Leu Lys Asn Met Ala Ser His Pro Val 
105 110 115 

GTT GGT GTA TCC TAC TGG AAA GCT TCT GAG v GGA GCT 1398 
Val Gly Val Ser Tyr Trp Lys Ala Ser Glu Gly Ala 
120 125 130 
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GAA TAT GAT GAT CAG ACC AGT CAA AGG GAG AAA GAA GAT GAT AAA GTC 14 4 6 

Glu Tyr Asp Asp Gin Thr Ser Gin Arg Glu Lys Glu Asp Asp Lys Val 
135 140 145 

TTC CCT GGT GGA AGC CAT ACA TAT GTC TGG CAG GTC CTG AAA GAG AAT 14 94 

Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gin Val Leu Lys Glu Asn 
150 155 160 

GGT CCA ATG GCC TCT GAC CCA CTG TGC CTT ACC TAC TCA TAT CTT TCT 154 2 

Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser Tyr Leu Ser 
165 170 175 

CAT GTG GAC CTG GTA AAA GAC TTG AAT TCA GGC CTC ATT GGA GCC CTA 15 90 

His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu lie Gly Ala Leu 
180 185 190 195 

CTA GTA TGT AGA GAA GGG AGT CTG GCC AAG GAA AAG ACA CAG ACC TTG 1638 
Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr Gin Thr Leu 
200 205 210 

CAC AAA TTT ATA CTA CTT TTT GCT GTA TTT GAT GAA GGG AAA AGT TGG 168 6 

His Lys Phe lie Leu Leu Phe Ala Val Phe Asp Glu Gly Lys Ser Trp 
215 220 225 

CAC TCA GAA ACA AAG AAC TCC TTG ATG CAG GAT AGG GAT GCT GCA TCT 17 34 

His Ser Glu Thr Lys Asn Ser Leu Met Gin Asp Arg Asp Ala Ala Ser 
230 235 240 

GCT CGG GCC TGG CCT AAA ATG CAC ACA GTC AAT GGT TAT GTA AAC AGG 17 82 

Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr Val Asn Arg 
245 250 255 . 

TCT CTG CCA GGT CTG ATT GGA TGC CAC AGG AAA TCA GTC TAT TGG CAT 1830 
Ser Leu Pro Gly Leu lie Gly Cys His Arg Lys Ser Val Tyr Trp His 
260 265 270 275 

GTG ATT GGA ATG GGC ACC ACT CCT GAA GTG CAC TCA ATA TTC CTC GAA 1878 
c Val lie Gly Met Gly Thr Thr Pro Glu Val His Ser lie Phe Leu Glu 
280 285 290 

GGT CAC ACA TTT CTT GTG AGG AAC CAT CGC CAG GCG TCC TTG GAA ATC 192 6 

Gly His Thr Phe Leu Val Arg Asn His Arg Gin Ala Ser Leu Glu lie 
295 300 305 

TCG CCA ATA ACT TTC CTT ACT GCT CAA ACA CTC TTG ATG GAC CTT GGA 197 4 

Ser Pro lie Thr Phe Leu Thr Ala Gin Thr Leu Leu Met Asp Leu Gly 
310 315 320 

CAG TTT CTA CTG TTT TGT CAT ATC TCT TCC CAC CAA CAT GAT GGC ATG 2022 
Gin Phe Leu Leu Phe Cys His lie Ser Ser His Gin His Asp Gly Met 
325 330 335 

GAA GCT TAT GTC AAA GTA GAC AGC TGT CCA GAG GAA CCC CAA CTA CGA 207 0 

Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro Gin Leu Arg 
340 345 350 355 
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ATG AAA AAT AAT GAA GAA GCG GAA GAC TAT GAT GAT GAT CTT ACT GAT 2118 
Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp Leu Thr Asp 
360 365 370 

TCT GAA ATG GAT GTG GTC AGG TTT GAT GAT GAC AAC TCT CCT TCC TTT 2166 
Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser Pro Ser Phe 
375 380 385 

ATC CAA ATT CGC TCA GTT GCC AAG AAG CAT CCT AAA ACT TGG GTA CAT 2214 
lie Gin lie Arg Ser Val Ala Lys Lys His Pro Lys Thr Trp Val His 
390 395 400 

TAC ATT GCT GCT GAA GAG GAG GAC TGG GAC TAT GCT CCC TTA GTC CTC 22 62 

Tyr lie Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro Leu Val Leu 
405 410 415 

GCC CCC GAT GAC AGA AGT TAT AAA AGT CAA TAT TTG AAC AAT GGC CCT 2 310 

Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gin Tyr Leu Asn Asn Gly Pro 
420 425 430 435 

CAG CGG ATT GGT AGG AAG TAC AAA AAA GTC CGA TTT ATG GCA TAC ACA 2358 
Gin Arg lie Gly Arg Lys Tyr Lys Lys Val Arg Phe Met Ala Tyr Thr 
440 445 450 

GAT GAA ACC TTT AAG ACT CGT GAA GCT ATT CAG CAT GAA TCA GGA ATC 2 4 06 

Asp Glu Thr Phe Lys Thr Arg Glu Ala lie Gin His Glu Ser Gly lie 
455 460 465 

TTG GGA CCT TTA CTT TAT GGG GAA GTT GGA GAC ACA CTG TTG ATT ATA 2 4 54 

Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu Leu lie lie 
470 475 480 

TTT AAG AAT CAA GCA AGC AGA CCA TAT AAC ATC TAC CCT CAC GGA ATC 2502 
Phe Lys Asn Gin Ala Ser Arg Pro Tyr Asn lie Tyr Pro His Gly lie 
485 490 495 

ACT GAT GTC CGT CCT TTG TAT TCA AGG AGA TTA CCA AAA GGT GTA AAA 2550 
Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys Gly Val Lys 
500 505 510 515 

CAT TTG AAG GAT TTT CCA ATT CTG CCA GGA GAA ATA TTC AAA TAT AAA 25 98 
His Leu Lys Asp Phe Pro lie Leu Pro Gly Glu lie Phe Lys Tyr Lys 
520 525 530 

TGG ACA GTG ACT GTA GAA GAT GGG CCA ACT AAA TCA GAT CCT CGG TGC 2 64 6 

Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp Pro Arg Cys 
535 540 545 

CTG ACC CGC TAT TAC TCT AGT TTC GTT AAT ATG GAG AGA GAT CTA GCT 2 694 

Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg Asp Leu Ala 
550 555 560 

TCA GGA CTC ATT GGC CCT CTC CTC ATC TGC TAC AAA GAA TCT GTA GAT 27 4 2 

Ser Gly Leu lie Gly Pro Leu Leu lie Cys Tyr Lys Glu Ser Val Asp 
565 570 575 
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^^ATA ATG TCA GAC AAG AGG AAT B 



CAA AGA GGA AAC CAG ATA ATG TCA GAC AAG AGG AAT GTCATC CTG TTT 27 90 

Gin Arg Gly Asn Gin lie Met Ser Asp Lys Arg Asn Val lie Leu Phe 
580 585 590 595 

TCT GTA TTT GAT GAG AAC CGA AGC TGG TAC CTC ACA GAG AAT ATA CAA 2838 
Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu Asn lie Gin 
600 605 610 

CGC TTT CTC CCC AAT CCA GCT GGA GTG CAG CTT GAG GAT CCA GAG TTC 288 6 

Arg Phe Leu Pro Asn Pro Ala Gly Val Gin Leu Glu Asp Pro Glu Phe 
615 620 * 625 

CAA GCC TCC AAC ATC ATG CAC AGC ATC AAT GGC TAT GTT TTT GAT AGT 2 934 

Gin Ala Ser Asn lie Met His Ser lie Asn Gly Tyr Val Phe Asp Ser 
630 635 640 

TTG CAG TTG TCA GTT TGT TTG CAT GAG GTG GCA TAC TGG TAC ATT CTA 2982 
Leu Gin Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp Tyr lie Leu 
645 650 655 

AGC ATT GGA GCA CAG ACT GAC TTC CTT TCT GTC TTC TTC TCT GGA TAT 3030 
Ser lie Gly Ala Gin Thr Asp Phe Leu Ser Val Phe Phe Ser Gly Tyr 
660 665 670 675 

ACC TTC AAA CAC AAA ATG GTC TAT GAA GAC ACA CTC ACC CTA TTC CCA 307 8 

Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr Leu Phe Pro 
680 685 690 

TTC TCA GGA GAA ACT GTC TTC ATG TCG ATG GAA AAC CCA GGT CTA TGG 312 6 

Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro Gly Leu Trp 
695 700 705 

ATT CTG GGG TGC CAC AAC TCA GAC TTT CGG AAC AGA GGC ATG ACC GCC 317 4 

lie Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly Met Thr Ala 
710 . 715 720 

TTA CTG AAG GTT TCT AGT TGT GAC AAG AAC ACT GGT GAT TAT TAC GAG 3222 
Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp Tyr Tyr Glu 
725 730 735 

GAC AGT TAT GAA GAT ATT TCA GCA TAC TTG CTG AGT AAA AAC AAT GCC 327 0 

Asp Ser Tyr Glu Asp lie Ser Ala Tyr Leu Leu Ser Lys Asn Asn Ala 
740 745 750 755 

ATT GAA CCA AGA AGC TTC TCC CAG AAC CCA CCA GTC TTG AAA CGC CAT 3318 
lie Glu Pro Arg Ser Phe Ser Gin Asn Pro Pro Val Leu Lys Arg His 
760 765 770 

CAA CGG GAA ATA ACT CGT ACT ACT CTT CAG TCA GAT CAA GAG GAA ATT 3366 
Gin Arg Glu lie Thr Arg Thr Thr Leu Gin Ser Asp Gin Glu Glu lie 
775 780 785 

GAC TAT GAT GAT ACC ATA TCA GTT GAA ATG AAG AAG GAA GAT TTT GAC 3414 
Asp Tyr Asp Asp Thr lie Ser Val Glu Met Lys Lys Glu Asp Phe Asp 
790 795 800 
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ATT TAT GAT GAG GAT GAA AAT CAG AGC CCC CGC AGC TTT CAA AAG AAA 34 62 
lie Tyr Asp Glu Asp Glu Asn Gin Ser Pro Arg Ser Phe Gin Lys Lys 
805 810 815 

ACA CGA CAC TAT TTT ATT GCT GCA GTG GAG AGG CTC TGG GAT TAT GGG 3510 
Thr Arg His Tyr Phe lie Ala Ala Val Glu Arg Leu Trp Asp Tyr Gly 
820 825 830 835 

ATG AGT AGC TCC CCA CAT GTT CTA AGA AAC AGG GCT CAG AGT GGC AGT 3558 
Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gin Ser Gly Ser 
840 845 850 

GTC CCT CAG TTC AAG AAA GTT GTT TTC CAG GAA TTT ACT GAT GGC TCC 3606 
Val Pro Gin Phe Lys Lys Val Val Phe Gin Glu Phe Thr Asp Gly Ser 
855 860 865 

TTT ACT CAG CCC TTA TAC CGT GGA GAA CTA AAT GAA CAT TTG GGA CTC 3654 
Phe Thr Gin Pro Leu Tyr Arg Gly Glu Leu Asn Glu His Leu Gly Leu 
870 875 880 

CTG GGG CCA TAT ATA AGA GCA GAA GTT GAA GAT AAT ATC ATG GTA ACT 37 02 

Leu Gly Pro Tyr lie Arg Ala Glu Val Glu Asp Asn lie Met Val Thr 
885 890 895 

TTC AGA AAT CAG GCC TCT CGT CCC TAT TCC TTC TAT TCT AGC CTT ATT 3750 
Phe Arg Asn Gin Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Ser Leu lie 
900 905 910 915 

TCT TAT GAG GAA GAT CAG AGG CAA GGA GCA GAA CCT AGA AAA AAC TTT 37 98 

Ser Tyr Glu Glu Asp Gin Arg Gin Gly Ala Glu Pro Arg Lys Asn Phe 
920 925 930 

GTC AAG CCT AAT GAA ACC AAA ACT TAC TTT TGG AAA GTG CAA CAT CAT 384 6 

Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val Gin His His 
935 940 945 

ATG GCA CCC ACT AAA GAT GAG TTT GAC TGC AAA GCC TGG GCT TAT TTC 38 94 

Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp Ala Tyr Phe 
950 955 960 

TCT GAT GTT GAC CTG GAA AAA GAT GTG CAC TCA GGC CTG ATT GGA CCC 394 2 

Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu lie Gly Pro 
965 970 975 

CTT CTG GTC TGC CAC ACT AAC ACA CTG AAC CCT GCT CAT GGG AGA CAA 3990 
Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His Gly Arg Gin 
980 985 990 995 

GTG ACA GTA CAG GAA TTT GCT CTG TTT TTC ACC ATC TTT GAT GAG ACC 4 038 

Val Thr Val Gin Glu Phe Ala Leu Phe Phe Thr He Phe Asp Glu Thr 
1000 1005 1010 

AAA AGC TGG TAC TTC ACT GAA AAT ATG GAA AGA AAC TGC AGG GCT CCC 4 08 6 

Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys Arg Ala Pro 
1015 1020 1025 
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TGC AAT ATC CAG ATG GAA GAT CCC ACT TTT AAA GAG AAT TAT CGC TTC 4134 
Cys Asn lie Gin Met Glu Asp Pro Thr Phe Lys Glu Asn Tyr Arg Phe 
1030 1035 1040 

CAT GCA ATC AAT GGC TAC ATA ATG GAT ACA CTA CCT GGC TTA GTA ATG 4182 
His Ala lie Asn Gly Tyr lie Met Asp Thr Leu Pro Gly Leu Val Met 
1045 1050 1055 

GCT CAG GAT CAA AGG ATT CGA TGG TAT CTG CTC AGC ATG GGC AGC AAT 4 230 

Ala Gin Asp Gin Arg lie Arg Trp Tyr Leu Leu Ser Met Gly Ser Asn 
1060 1065 1070 1075 

GAA AAC ATC CAT TCT ATT CAT TTC AGT GGA CAT GTG TTC ACT GTA CGA 4 27 8 

Glu Asn lie His Ser lie His Phe Ser Gly His Val Phe Thr Val Arg 
1080 1085 1090 

AAA AAA GAG GAG TAT AAA ATG GCA CTG TAC AAT CTC TAT CCA GGT GTT 4 32 6 

Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr Pro Gly Val 
1095 1100 1105 

TTT GAG ACA GTG GAA ATG TTA CCA TCC AAA GCT GGA ATT TGG CGG GTG 4 37 4 

Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly lie Trp Arg Val 
1110 1115 1120 

GAA TGC CTT ATT GGC GAG CAT CTA CAT GCT GGG ATG AGC ACA CTT TTT 4 4 22 

Glu Cys Leu lie Gly Glu His Leu His Ala Gly Met Ser Thr Leu Phe 
1125 1130 1135 

CTG GTG TAC AGC AAT AAG TGT CAG ACT CCC CTG GGA ATG GCT TCT GGA 4 4 70 

Leu Val Tyr Ser Asn Lys Cys Gin Thr Pro Leu Gly Met Ala Ser Gly 
1140 1145 1150 1155 

CAC ATT AGA GAT TTT CAG ATT ACA GCT TCA GGA CAA TAT GGA CAG TGG 4 518 

His lie Arg Asp Phe Gin lie Thr Ala Ser Gly Gin Tyr Gly Gin Trp 
1160 1165 1170 

GCC CCA AAG CTG GCC AGA CTT CAT TAT TCC GGA TCA ATC AAT GCC TGG 4 5 66 

Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser lie Asn Ala Trp 
1175 1180 1185 

AGC ACC AAG GAG CCC TTT TCT TGG ATC AAG GTG GAT CTG TTG GCA CCA 4 614 

Ser Thr Lys Glu Pro Phe Ser Trp lie Lys Val Asp Leu Leu Ala Pro 
1190 1195 1200 

ATG ATT ATT CAC GGC ATC AAG ACC CAG GGT GCC CGT CAG AAG TTC TCC 4 662 

Met lie lie His Gly lie Lys Thr Gin Gly Ala Arg Gin Lys Phe Ser 
1205 1210 1215 

AGC CTC TAC ATC TCT CAG TTT ATC ATC ATG TAT AGT CTT GAT GGG AAG 4 710 

Ser Leu Tyr lie Ser Gin Phe lie lie Met Tyr Ser Leu Asp Gly Lys 
1220 1225 1230 1235 

AAG TGG CAG ACT TAT CGA ■ GGA AAT TCC ACT GGA ACC TTA ATG GTC TTC 4 7 58 

Lys Trp Gin Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu Met Val Phe 
1240 1245 1250 
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TTT GGC AAT GTG GAT TCA TCT GGG ATA AAA CAC AAT ATT TTT AAC CCT 4 806 

Phe Gly Asn Val Asp Ser Ser Gly lie Lys His Asn lie Phe Asn Pro 
1255 1260 1265 

CCA ATT ATT GCT CGA TAC ATC CGT TTG CAC CCA ACT CAT TAT AGC ATT 4 854 

Pro lie lie Ala Arg Tyr lie Arg Leu His Pro Thr His Tyr Ser lie 
1270 1275 1280 

CGC AGC ACT CTT CGC ATG GAG TTG ATG GGC TGT GAT TTA AAT AGT TGC 4 902 

Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu. Asn Ser Cys 
1285 1290 1295 

AGC ATG CCA TTG GGA ATG GAG AGT AAA GCA ATA TCA GAT GCA CAG ATT 4 950 

Ser Met Pro Leu Gly Met Glu Ser Lys Ala lie Ser Asp Ala Gin lie 
1300 1305 1310 1315 

ACT GCT TCA TCC TAC TTT ACC AAT ATG TTT GCC ACC TGG TCT CCT TCA 4 998 

Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser Pro Ser 
1320 1325 1330 

AAA GCT CGA CTT CAC CTC CAA GGG AGG AGT AAT GCC TGG AGA CCT CAG 504 6 

Lys Ala Arg Leu His Leu Gin Gly Arg Ser Asn Ala Trp Arg Pro Gin 
1335 1340 1345 

GTG AAT AAT CCA AAA GAG TGG CTG CAA GTG GAC TTC CAG AAG ACA ATG 50 94 

Val Asn Asn Pro Lys Glu Trp Leu Gin Val Asp Phe Gin Lys Thr Met 
1350 1355 1360 

AAA GTC ACA GGA GTA ACT ACT CAG GGA GTA AAA TCT CTG CTT ACC AGC 514 2 

Lys Val Thr Gly Val Thr Thr Gin Gly Val Lys Ser Leu Leu Thr Ser 
1365 1370 1375 

ATG TAT GTG AAG GAG TTC CTC ATC TCC AGC AGT CAA GAT GGC CAT CAG 5190 
Met Tyr Val Lys Glu Phe Leu lie Ser Ser Ser Gin Asp Gly His Gin 
1380 1385 1390 1395 

TGG ACT CTC TTT TTT CAG AAT GGC AAA GTA AAG GTT TTT CAG GGA AAT 5238 
Trp Thr Leu Phe Phe Gin Asn Gly Lys Val Lys Val Phe Gin Gly Asn 
1400 1405 1410 

CAA GAC TCC TTC ACA CCT GTG GTG AAC TCT CTA GAC CCA CCG TTA CTG 528 6 

Gin Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro Pro Leu Leu 
1415 1420 1425 

' ACT CGC TAC CTT CGA ATT CAC CCC CAG AGT TGG GTG CAC CAG ATT GCC 5334 
Thr Arg Tyr Leu Arg lie His Pro Gin Ser Trp Val His Gin lie Ala 
1430 1435 1440 

CTG AGG ATG GAG GTT CTG GGC TGC GAG GCA CAG GAC CTC TAC 537 6 

Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gin Asp Leu Tyr 
1445 1450 1455 

TGAGGGTGGC CAC TGC AGC A CCTGCCACTG CCGTCACCTC TCCCTCCTCA GCTCCAGGGC 54 36 

AGTGTCCCTC CCTGGCTTGC CTTCTACCTT TGTGCTAAAT CCTAGCAGAC ACTGCCTTGA 54 96 

AGCCTCCTGA ATTAACTATC ATCAGTCCTG CATTTCTTTG GTGGGGGGCC AGGAGGGTGC 555 6 
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ATCCAATTTA ACTTAACTCT TACCTATTTT CTGCAGCTGC TCCCAGATTA CTCCTTCCTT 5616 

CCAATATAAC TAGGCAAAAA GAAGTGAGGA GAAACCTGCA T G AAAG CAT T CTTCCCTGAA 5 67 6 

AAGTTAGGCC TCTCAGAGTC ACCACTTCCT CTGTTGTAGA AAAACTATGT GATGAAACTT 5736 

TGAAAAAGAT ATT TAT GAT G TTAACTTGTT TATTGCAGCT TATAATGGTT ACAAATAAAG 57 96 

CAATAGCATC ACAAATTTCA CAAATAAAGC ATTTTTTTCA CTGCATTCTA GTTGTGGTTT 585 6 

GTCCAAACTC ATCAATGTAT CTTATCATGT CTGGATCCCC GGGTGGCATC CCTGTGACCC 5 916 

CTCCCCAGTG CCTCTCCTGG CCCTGGAAGT TGCCACTCCA GTGCCCACCA GCCTTGTCCT 5 97 6 

AATAAAATTA AGTTGCATCA TTTTGTCTGA CTAGGTGTCC TTCTATAATA TTATGGGGTG 6036 

GAGGGGGGTG GTATGGAGCA AGGGGCAAGT TGGGAAGACA ACCTGTAGGG CCTGCGGGGT 6096 

CTATTCGGGA ACCAAGCTGG AGTGCAGTGG CACAATCTTG GCTCACTGCA ATCTCCGCCT 6156 

CCTGGGTTCA AGCGATTCTC CTGCCTCAGC CTCCCGAGTT GTTGGGATTC CAGGCATGCA 6216 

TGACCAGGCT CAGCTAATTT TTGTTTTTTT GGTAGAGACG GGGTTTCACC ATATTGGCCA 627 6 

GGCTGGTCTC CAACTCCTAA TCTCAGGTGA TCTACCCACC TTGGCCTCCC AAATTGCTGG 6336 

GATTACAGGC GTGAACCACT GCTCCCTTCC CTGTCCTTCT GATTTTAAAA TAACTATACC 6396 

AGCAGGAGGA CGTCCAGACA CAGCATAGGC TACCTGCCAT GCCCAACCGG TGGGACATTT 64 56 

GAGTTGCTTG CTTGGCACTG TCCTCTCATG CGTTGGGTCC ACTCAGTAGA TGCCTGTTGA 6516 

ATTCGTAATC ATGGTCATAG CTGTTTCCTG TGTGAAATTG TTATCCGCTC ACAATTCCAC 657 6 

ACAACATACG AGCCGGAAGC ATAAAGTGTA AAGCCTGGGG TGCCTAATGA GTGAGCTAAC 6636 

TCACATTAAT TGCGTTGCGC TCACTGCCCG CTTTCCAGTC GGGAAACCTG TCGTGCCAGC 6696 

TGCATTAATG AATCGGCCAA CGCGCGGGGA GAGGCGGTTT GCGTATTGGG CGCTCTTCCG 67 56 

CTTCCTCGCT CACTGACTCG CTGCGCTCGG TCGTTCGGCT GCGGCGAGCG GTATCAGCTC 6816 

ACTCAAAGGC GGTAATACGG TTATCCACAG AATCAGGGGA TAACGCAGGA AAGAACATGT 687 6 

GAGCAAAAGG CCAGCAAAAG GCCAGGAACC GTAAAAAGGC CGCGTTGCTG GCGTTTTTCC 69 36 

ATAGGCTCCG CCCCCCTGAC GAGCATCACA AAAATCGACG CTCAAGTCAG AGGTGGCGAA 6996 

ACCCGACAGG ACTATAAAGA TACCAGGCGT TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC 7 05 6 

CTGTTCCGAC CCTGCCGCTT ACCGGATACC TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG 7116 

CGCTTTCTCA TAGCTCACGC TGTAGGTATC TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC 717 6 

TGGGCTGTGT GCACGAACCC CCCGTTGAGC CCGACCGCTG CGCCTTATCC GGTAACTATC 7 2 36 

GTCTTGAGTC CAACCCGGTA AGACACGACT TATCGCCACT GGCAGCAGCC ACTGGTAACA 72 96 
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GGATTAGCAG AGCGAGGTAT GTAGGCGGTG CTACAGAGTT CTTGAAGTGG TGGCCTAACT 7 35 6 

ACGGCTACAC T AG AAG G AC A GTATTTGGTA TCTGCGCTCT GCTGAAGCCA GTTACCTTCG 7416 

G AAAAAG AG T TGGTAGCTCT TGATCCGGCA AACAAACCAC CGCTGGTAGC GGTGGTTTTT 74 7 6 

TTGTTTGCAA GCAGCAGATT ACGCGCAGAA AAAAAGGATC TC AAG AAG AT CCTTTGATCT 7 53 6 

TTTCTACGGG GTCTGACGCT CAGTGGAACG AAAACTCACG TTAAGGGATT TTGGTCATGA 7 5 96 

GATTATCAAA AAGGATCTTC ACCTAGATCC TTTTAAATTA AAAATGAAGT TTTAAATCAA 7 65 6 

TCTAAAGTAT ATATGAGTAA ACTTGGTCTG ACAGTTACCA ATGCTTAATC AGTGAGGCAC 7716 

CTATCTCAGC GATCTGTCTA TTTCGTTCAT CCATAGTTGC CTGACTCCCC GTCGTGTAGA 77 7 6 

TAACTACGAT ACGGGAGGGC TTACCATCTG GCCCCAGTGC TGCAATGATA CCGCGAGACC 7 8 36 

CACGCTCACC GGCTCCAGAT TTATCAGCAA TAAACCAGCC AGCCGGAAGG GCCGAGCGCA 7 8 96 

GAAGTGGTCC TGCAACTTTA TCCGCCTCCA TCCAGTCTAT TAATTGTTGC CGGGAAGCTA 7 95 6 

GAGTAAGTAG TTCGCCAGTT AATAGTTTGC GCAACGTTGT TGCCATTGCT ACAGGCATCG 8016 

TGGTGTCACG CTCGTCGTTT GGTATGGCTT CATTCAGCTC CGGTTCCCAA C GAT C AAG G C 8 07 6 

GAGTTACATG ATCCCCCATG TTGTGCAAAA AAGCGGTTAG CTCCTTCGGT CCTCCGATCG 813 6 

T T G T C AG AAG TAAGTTGGCC GCAGTGTTAT CACTCATGGT TATGGCAGCA CTGCATAATT 8196 

CTCTTACTGT CATGCCATCC GTAAGATGCT TTTCTGTGAC TGGTGAGTAC TCAACCAAGT 825 6 

CATTCTGAGA ATAGTGTATG CGGCGACCGA GTTGCTCTTG CCCGGCGTCA ATACGGGATA 8 316 

ATACCGCGCC ACATAGCAGA ACTTTAAAAG TGCTCATCAT TGGAAAACGT TCTTCGGGGC 8 37 6 

GAAAACTCTC AAGGATCTTA CCGCTGTTGA GATCCAGTTC GATGTAACCC ACTCGTGCAC 8 4 36 

CCAACTGATC TTCAGCATCT TTTACTTTCA CCAGCGTTTC TGGGTGAGCA AAAACAGGAA 8 4 96 

GGCAAAATGC CGCAAAAAAG GGAATAAGGG CGACACGGAA ATGTTGAATA CTCATACTCT 8 556 

TCCTTTTTCA ATATTATTGA AGCATTTATC AGGGTTATTG TCTCATGAGC G GAT AC AT AT 8 616 

TTGAATGTAT T T AG AAAAAT AAACAAATAG GGGTTCCGCG CACATTTCCC CGAAAAGTGC 8 67 6 

CACCTGACGT CTAAGAAACC ATTATTATCA T G AC AT T AAC CTAT AAAAAT AGGCGTATCA 87 36 

CGAGGCCCTT TCGTCTCGCG CGTTTCGGTG ATGACGGTGA AAACCTCTGA CACATGCAGC 87 96 

TCCCGGAGAC GGTCACAGCT TGTCTGTAAG CGGATGCCGG GAGCAGACAA GCCCGTCAGG 88 56 

GCGCGTCAGC GGGTGTTGGC GGGTGTCGGG GCTGGCTTAA CTATGCGGCA TCAGAGCAGA 8 916 

TTGTACTGAG AGTGCACCAT- ATGCGGTGTG AAATACCGCA CAGATGCGTA AGGAGAAAAT 8 97 6 

ACCGCATCAG GCGCCATTCG CCATTCAGGC TGCGCAACTG TTGGGAAGGG CGATCGGTGC 903 6 
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GGGCCTCTTC GCTATTACGC CAGCTGGCGA AAGGGGGATG TGCTGCAAGG CGATTAAGTT 9096 
GGGTAACGCC AGGGTTTTCC CAGTCACGAC GTTGTAAAAC GACGGCCAGT GCCAAGCTTG 915 6 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1006.. 3294 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 6153.. 8234 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GTCGACGGTA TCGATAAGCT TGATATCGAA TTCCTGCAGC CCGGGGGATC CACTAGTACT 60 

CGAGACCTAG GAGTTAATTT TTAAAAAGCA GTCAAAAGTC CAAGTGGCCC TTGCGAGCAT 120 

TTACTCTCTC TGTTTGCTCT GGTTAATAAT CTCAGGAGCA CAAACATTCC TTACTAGTCC 18 0 

TAGAAGTTAA TTTTTAAAAA GCAGTCAAAA GTCCAAGTGG CCCTTGCGAG CATTTACTCT 24 0 

CTCTGTTTGC TCTGGTTAAT AATCTCAGGA GCACAAACAT TCCTTACTAG TTCTAGAGCG 300 

GCCGCCAGTG TGCTGGAATT CGGCTTTTTT AGGGCTGGAA GCTACCTTTG ACATCATTTC 360 

CTCTGCGAAT GCATGTATAA TTTCTACAGA AC C TAT TAG A AAGGATCACC CAGCCTCTGC 4 20 

TTTTGTACAA CTTTCCCTTA AAAAACTGCC AATTCCACTG CTGTTTGGCC C AAT AG T GAG 4 80 

AACTTTTTCC TGCTGCCTCT TGGTGCTTTT GCCTATGGCC CCTATTCTGC CTGCTGAAGA 54 0 

CACTCTTGCC AGCATGGACT TAAACCCCTC CAGCTCTGAC AATCCTCTTT CTCTTTTGTT 600 

TTACATGAAG GGTCTGGCAG CCAAAGCAAT CACTCAAAGT TCAAACCTTA TCATTTTTTG 660 

CTTTGTTCCT CTTGGCCTTG GTTTTGTACA TCAGCTTTGA AAATACCATC CCAGGGTTAA 720 

TGCTGGGGTT AATTTATAAC TAAGAGTGCT CTAGTTTTGC AAT AC AG G AC ATGCTATAAA 7 80 

AATGGAAAGA TGTTGCTTTC TGAGAGATCT CGAGGAAGCT AACAACAAAG AACAACAAAC 84 0 

AACAATCAGG TAAGTATCCT TTTTACAGCA CAACTTAATG AGACAGATAG AAACTGGTCT 900 

TGTAGAAACA GAGTAGTCGC CTGCTTTTCT GCCAGGTGCT GACTTCTCTC CCCTTCTCTT 960 



GGCTGCAG 



9164 



18 



TTTTCCTTTT CTCAGGATAA CAAGAACGAA ACAATAACAG CCACC ATG GAA ATA 1014 

Met Glu lie 
1 

GAG CTC TCC ACC TGC TTC TTT CTG TGC CTT TTG CGA TTC TGC TTT AGT 10 62 

Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe Cys Phe Ser 
5 10 15 

GCC ACC AGA AGA TAC TAC CTG GGT GCA GTG GAA CTG TCA TGG GAC TAT 1110 
Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser Trp Asp Tyr 
20 25 30 35 

ATG CAA AGT GAT CTC GGT GAG CTG CCT GTG GAC GCA AGA TTT CCT CCT 1158 
Met Gin Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg Phe Pro Pro 
40 45 50 

AGA GTG CCA AAA TCT TTT CCA TTC AAC ACC TCA GTC GTG TAC AAA AAG 120 6 

Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val Tyr Lys Lys 
55 60 65 

ACT CTG TTT GTA GAA TTC ACG GTT CAC CTT TTC AAC ATC GCT AAG CCA 1254 
Thr Leu Phe Val Glu Phe Thr Val His Leu Phe Asn lie Ala Lys Pro 

70 75 80 

AGG CCA CCC TGG ATG GGT CTG CTA GGT CCT ACC ATC CAG GCT GAG GTT 1302 
Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr lie Gin Ala Glu Val 
85 90 95 

TAT GAT ACA GTG GTC ATT ACA CTT AAG AAC ATG GCT TCC CAT CCT GTC 1350 
Tyr Asp Thr Val Val lie Thr Leu Lys Asn Met Ala Ser His Pro Val 
100 105 110 . 115 

AGT CTT CAT GCT GTT GGT GTA TCC TAC TGG AAA GCT TCT GAG GGA GCT 1398 
Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser Glu Gly Ala 
120 125 130 

GAA TAT GAT GAT CAG ACC AGT CAA AGG GAG AAA GAA GAT GAT AAA GTC 14 4 6 

Glu Tyr Asp Asp Gin Thr Ser Gin Arg Glu Lys Glu Asp Asp Lys Val 
135 140 145 

TTC CCT GGT GGA AGC CAT ACA TAT GTC TGG CAG GTC CTG AAA GAG AAT 14 94 

Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gin Val Leu Lys Glu Asn 
150 155 160 

GGT CCA ATG GCC TCT GAC CCA CTG TGC CTT ACC TAC TCA TAT CTT TCT 15 4 2 

Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser Tyr Leu Ser 
165 170 175 

CAT GTG GAC CTG GTA AAA GAC TTG AAT TCA GGC CTC ATT GGA GCC CTA 15 90 

His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu lie Gly Ala Leu 
180 185 190 195 

CTA GTA TGT AGA GAA GGG AGT CTG GCC AAG GAA AAG ACA CAG ACC TTG 1638 
Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr Gin Thr Leu 
200 205 210 
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CAC AAA TTT ATA CTA CTT TTT GCT GTA TTT GAT GAA GGG AAA AGT TGG 168 6 

His Lys Phe lie Leu Leu Phe Ala Val Phe Asp Glu Gly Lys Ser Trp 
215 220 225 

CAC TCA GAA ACA AAG AAC TCC TTG ATG CAG GAT AGG GAT GCT GCA TCT 17 34 

His Ser Glu Thr Lys Asn Ser Leu Met Gin Asp Arg Asp Ala Ala Ser 
230 235 240 



GCT CGG GCC TGG CCT AAA ATG CAC ACA GTC AAT GGT TAT GTA AAC AGG 17 82 

Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr Val Asn Arg 
245 250 255 

TCT CTG CCA GGT CTG ATT GGA TGC CAC AGG AAA TCA GTC TAT TGG CAT 18 30 

Ser Leu Pro Gly Leu lie Gly Cys His Arg Lys Ser Val Tyr Trp His 
260 265 270 275 



GTG ATT GGA ATG GGC ACC ACT CCT GAA GTG CAC TCA ATA TTC CTC GAA 187 8 

Val lie Gly Met Gly Thr Thr Pro Glu Val His Ser lie Phe Leu Glu 
280 285 290 



GGT CAC ACA TTT CTT GTG AGG AAC CAT CGC CAG GCG TCC TTG GAA ATC 192 6 

Gly His Thr Phe Leu Val Arg Asn His Arg Gin Ala Ser Leu Glu lie 
295 300 305 



TCG CCA ATA ACT TTC CTT ACT GCT CAA ACA CTC TTG ATG GAC CTT GGA 197 4 

Ser Pro lie Thr Phe Leu Thr Ala Gin Thr Leu Leu Met Asp Leu Gly 
310 315 320 



CAG TTT CTA CTG TTT TGT CAT ATC TCT TCC CAC CAA CAT GAT GGC ATG 2022 
Gin Phe Leu Leu Phe Cys His lie Ser Ser His Gin His Asp Gly Met 
325 330 335 



GAA GCT TAT GTC AAA GTA GAC AGC TGT CCA GAG GAA CCC CAA CTA CGA 2070 
Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro Gin Leu Arg 
340 345 350 355 



ATG AAA AAT AAT GAA GAA GCG GAA GAC TAT GAT GAT GAT CTT ACT GAT 2118 
Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp Leu Thr Asp 
360 365 370 



TCT GAA ATG GAT GTG GTC AGG TTT GAT GAT GAC AAC TCT CCT TCC TTT 2166 
Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser Pro Ser Phe 
375 380 385 



ATC CAA ATT CGC TCA GTT GCC AAG AAG CAT CCT AAA ACT TGG GTA CAT 2214 

lie Gin lie Arg Ser Val Ala Lys Lys His Pro Lys Thr Trp Val His 
390 395 400 

TAC ATT GCT GCT GAA GAG GAG GAC TGG GAC TAT GCT CCC TTA GTC CTC 22 62 

Tyr lie Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro Leu Val Leu 

405 410 415 



GCC CCC GAT GAC AGA AGT TAT AAA AGT CAA TAT TTG AAC AAT GGC CCT 2310 
Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gin Tyr Leu Asn Asn Gly Pro 
420 425 430 435 
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CAG CGG ATT 
Gin Arg lie 



GAT GAA ACC 
Asp Glu Thr 



TTG GGA CCT 
Leu Gly Pro 
470 

TTT AAG AAT 
Phe Lys Asn 
485 

ACT GAT GTC 
Thr Asp Val 
500 

CAT TTG AAG 
His Leu Lys 



TGG ACA GTG 
Trp Thr Val 



CTG ACC CGC 
Leu Thr Arg 
550 

TCA GGA CTC 
Ser Gly Leu 
565 

CAA AGA GGA 
Gin Arg Gly 
580 

TCT GTA TTT 
Ser Val Phe 



CGC TTT CTC 
Arg Phe Leu 



CAA GCC TCC 
Gin Ala Ser 
630 

TTG CAG TTG 
Leu Gin Leu 
645 




GGT AGG AAG 
Gly Arg Lys 
440 

TTT AAG ACT 
Phe Lys Thr 
455 

TTA CTT TAT 
Leu Leu Tyr 



CAA GCA AGC 
Gin Ala Ser 



CGT CCT TTG 
Arg Pro Leu 
505 

GAT TTT CCA 
Asp Phe Pro 
520 

ACT GTA GAA 
Thr Val Glu 
535 

TAT TAC TCT 
Tyr Tyr Ser 



ATT GGC CCT 
lie Gly Pro 



AAC CAG ATA 
Asn Gin lie 
585 

GAT GAG AAC 
Asp Glu Asn 
600 

CCC AAT CCA 
Pro Asn Pro 
615 

AAC ATC ATG 
Asn lie Met 



TCA GTT TGT 
Ser Val Cys 



TAC AAA AAA 
Tyr Lys Lys 



CGT GAA GCT 
Arg Glu Ala 
460 

GGG GAA GTT 
Gly Glu Val 
475 

AGA CCA TAT 
Arg Pro Tyr 
490 

TAT TCA AGG 
Tyr Ser Arg 



ATT CTG CCA 
lie Leu Pro 



GAT GGG CCA 
Asp Gly Pro 
540 

AGT TTC GTT 
Ser Phe Val 

555 

CTC CTC ATC 
Leu Leu lie 
570 

ATG TCA GAC 
Met Ser Asp 

CGA AGC TGG 
Arg Ser Trp 



GCT GGA GTG 
Ala Gly Val 
620 

CAC AGC ATC 
His Ser lie 
635 

TTG CAT GAG 
Leu His Glu 
650 



GTC CGA TTT 
Val Arg Phe 
445 

ATT CAG CAT 
lie Gin His 



GGA GAC ACA 
Gly Asp Thr 



AAC ATC TAC 
Asn lie Tyr 
495 

AGA TTA CCA 
Arg Leu Pro 
510 

GGA GAA ATA 
Gly Glu He 
525 

ACT AAA TCA 
Thr Lys Ser 



AAT ATG GAG 
Asn Met Glu 



TGC TAC AAA 
Cys Tyr Lys 
575 

AAG AGG AAT 
Lys Arg Asn 
590 

TAC CTC ACA 
Tyr Leu Thr 
605 

CAG CTT GAG 
Gin Leu Glu 



AAT GGC TAT 
Asn Gly Tyr 



GTG GCA TAC 
Val Ala Tyr 
655 




ATG GCA TAC 
Met Ala Tyr 
450 

GAA TCA GGA 
Glu Ser Gly 
465 

CTG TTG ATT 
Leu Leu He 
480 

CCT CAC GGA 
Pro His Gly 



AAA GGT GTA 
Lys Gly Val 



TTC AAA TAT 
Phe Lys Tyr 
530 

GAT CCT CGG 
Asp Pro Arg 
545 

AGA GAT CTA 
Arg Asp Leu 
560 

GAA TCT GTA 
Glu Ser Val 



GTC ATC CTG 
Val He Leu 



GAG AAT ATA 
Glu Asn He 
610 

GAT CCA GAG 
Asp Pro Glu 
625 

GTT TTT GAT 
Val Phe Asp 
640 

TGG TAC ATT 
Trp Tyr He 



ACA 2358 
Thr 



ATC 2406 
He 



ATA 24 54 
He 



ATC 2502 
He 



AAA 2550 

Lys 

515 

AAA 2598 
Lys 



TGC 2 64 6 
Cys 



GCT 2694 
Ala 



GAT 2742 
Asp 



TTT 2790 

Phe 

595 

CAA 2838 
Gin 



TTC 2886 
Phe 



AGT 2934 
Ser 



CTA 2982 
Leu 
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ACT GAC TTC CTT TCT GTC TTC ^^^T 



AGC ATT GGA GCA CAG ACT GAC TTC CTT TCT GTC TTC TTC TCT GGA TAT 3030 
Ser lie Gly Ala Gin Thr Asp Phe Leu Ser Val Phe Phe Ser Gly Tyr 
660 665 670 675 

ACC TTC AAA CAC AAA ATG GTC TAT GAA GAC ACA CTC ACC CTA TTC CCA 3078 
Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr Leu Phe Pro 
680 685 690 

TTC TCA GGA GAA ACT GTC TTC ATG TCG ATG GAA AAC CCA GGT CTA TGG 312 6 
Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro Gly Leu Trp 
695 700 ' 705 

ATT CTG GGG TGC CAC AAC TCA GAC TTT CGG AAC AGA GGC ATG ACC GCC 317 4 
lie Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly Met Thr Ala 
710 715 720 

TTA CTG AAG GTT TCT AGT TGT GAC AAG AAC ACT GGT GAT TAT TAC GAG 3222 
Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp Tyr Tyr Glu 
725 730 735 

GAC AGT TAT GAA GAT ATT TCA GCA TAC TTG CTG AGT AAA AAC AAT GCC 327 0 
Asp Ser Tyr Glu Asp lie Ser Ala Tyr Leu Leu Ser Lys Asn Asn Ala 
740 745 750 755 

ATT GAA CCA AGA AGC TTC TCC CAG GTAAGTTATT ATATAAATTC AAGACACCCT 3324 
lie Glu Pro Arg Ser Phe Ser Gin 
760 

AGCACTAGGC AAAAGCAATT TAATGCCACC AC AAT TCC AG AAAATGACAT AG AG AAG AC T 3384 

GACCCTTGGT TTGCACACAG AACACCTATG CCTAAAATAC AAAATGTCTC CTCTAGTGAT 34 4 4 

TTGTTGATGC TCTTGCGACA GAGTCCTACT CCACATGGGC TATCCTTATC TGATCTCCAA 3504 

GAAGCCAAAT ATGAGACTTT TTCTGATGAT CCATCACCTG GAGCAATAGA CAGTAATAAC 3564 

AGCCTGTCTG AAATGACACA CTTCAGGCCA CAGCTCCATC ACAGTGGGGA CATGGTATTT 362 4 

ACCCCTGAGT CAGGCCTCCA ATTAAGATTA AATGAGAAAC TG GGG AC AAC TGCAGCAACA 368 4 

GAGTTGAAGA AACTTGATTT CAAAGTTTCT AG TAC AT C AA ATAATCTGAT T T C AAC AAT T 37 4 4 

C CAT CAG AC A ATTTGGCAGC AGGTACTGAT AATACAAGTT CCTTAGGACC CCCAAGTATG 3804 

CCAGTTCATT AT GAT AG T C A AT TAG AT AC C ACTCTATTTG GCAAAAAGTC ATCTCCCCTT 38 64 

ACTGAGTCTG GTGGACCTCT GAGCTTGAGT GAAGAAAATA ATGATTCAAA GTTGTTAGAA 3924 

TCAGGTTTAA TGAATAGCCA AGAAAGTTCA TGGGGAAAAA ATGTATCGTC AAC AG AG AG T 398 4 

GGTAGGTTAT TTAAAGGGAA AAG AGC T CAT GGACCTGCTT TGTTGACTAA AGATAATGCC 4 04 4 

TTATTCAAAG TTAGCATCTC TTTGTTAAAG ACAAACAAAA CTTCCAATAA TTCAGCAACT 4104 

AATAGAAAGA CTCACATTGA TGGCCCATCA TTATTAATTG AGAATAGTCC ATCAGTCTGG 4164 

CAAAATATAT T AG AAAGT G A CACTGAGTTT AAAAAAGTGA CACCTTTGAT TCATGACAGA 4 22 4 
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ATGCTTATGG ACAAAAATGC TACAGCTTTG AGGCTAAATC ATATGTCAAA TAAAACTACT 4 28 4 

TCATCAAAAA ACATGGAAAT GGTCCAACAG AAAAAAGAGG GCCCCATTCC ACCAGATGCA 4 34 4 

CAAAATCCAG ATATGTCGTT CTTTAAGATG CTATTCTTGC CAGAATCAGC AAGGTGGATA 4 404 

CAAAGGACTC ATGGAAAGAA CTCTCTGAAC TCTGGGCAAG GCCCCAGTCC AAAGCAATTA 4 4 64 

GTATCCTTAG G AC C AG AAAA ATCTGTGGAA GGTCAGAATT TCTTGTCTGA GAAAAACAAA 4 524 

GTGGTAGTAG GAAAGGGTGA ATT T AC AAAG GACGTAGGAC TCAAAGAGAT GGTTTTTCCA 4 58 4 

AGCAGCAGAA ACCTATTTCT TACTAACTTG GATAATTTAC ATGAAAATAA TACACACAAT 4 64 4 

CAAGAAAAAA AAATTCAGGA AGAAATAGAA AAG AAG G AAA CATTAATCCA AG AG AAT G T A 4 704 

GTTTTGCCTC AGATACATAC AGTGACTGGC ACTAAGAATT TCATGAAGAA CCTTTTCTTA 4 7 64 

CTGAGCACTA GGCAAAATGT AGAAGGTTCA TATGAGGGGG CATATGCTCC AGTACTTCAA 4 82 4 

GATTTTAGGT CATTAAATGA TTCAACAAAT AGAACAAAGA AACACACAGC TCATTTCTCA 4 88 4 

AAAAAAGGGG AGGAAGAAAA CTTGGAAGGC TTGGGAAATC AAACCAAGCA AATTGTAGAG 4 94 4 

AAAT AT G CAT G C AC C AC AAG GATATCTCCT AATACAAGCC AGCAGAATTT TGTCACGCAA 5004 

CGTAGTAAGA GAGCTTTGAA ACAATTCAGA CTCCCACTAG AAGAAACAGA ACTTGAAAAA 50 64 

AGGATAATTG TGGATGACAC CTCAACCCAG TGGTCCAAAA ACATGAAACA TTTGACCCCG 512 4 

AGCACCCTCA CACAGATAGA CTACAATGAG AAG GAG AAAG GGGCCATTAC TCAGTCTCCC 518 4 

T TAT C AG ATT GCCTTACGAG GAGTCATAGC ATCCCTCAAG CAAATAGATC TCCATTACCC 524 4 

ATTGCAAAGG TATCATCATT TCCATCTATT AGACCTATAT ATCTGACCAG GGTCCTATTC 5304 

CAAGACAACT CTTCTCATCT TCCAGCAGCA TCTTATAGAA AG AAAG AT T C TGGGGTCCAA 53 64 

GAAAGCAGTC ATTTCTTACA AGGAGCCAAA AAAAATAACC TTTCTTTAGC CATTCTAACC 54 2 4 

TTGGAGATGA CTGGTGATCA AAG AG AG G T T GGCTCCCTGG GGACAAGTGC C AC AAAT T C A 54 8 4 

GTCACATACA AG AAAG T T G A GAACACTGTT CTCCCGAAAC CAGACTTGCC CAAAACATCT 554 4 

GGCAAAGTTG AATTGCTTCC AAAAGTTCAC ATTTATCAGA AGGACCTATT CCCTACGGAA 5 604 

ACTAGCAATG GGTCTCCTGG CCATCTGGAT CTCGTGGAAG GGAGCCTTCT TCAGGGAACA 5664 

GAGGGAGCGA TTAAGTGGAA TGAAGCAAAC AGACCTGGAA AAGTTCCCTT TCTGAGAGTA 5724 

GCAACAGAAA GCTCTGCAAA GACTCCCTCC AAGCTATTGG ATCCTCTTGC TTGGGATAAC 578 4 

CACTATGGTA CTCAGATACC AAAAGAAGAG TGGAAATCCC AAG AG AAG T C ACCAGAAAAA 584 4 

ACAGCTTTTA AGAAAAAGGA TAGCATTTTG TCCCTGAACG CTTGTGAAAG CAATCATGCA 5904 

ATAGCAGCAA TAAATGAGGG ACAAAATAAG CCCGAAATAG AAGTCACCTG GGCAAAGCAA 5964 
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GGTAGGACTG AAAGGCTGTG CTCTCAATTG TGCTAATAAA GCTTGGCAAG AGTATTTCAA 6024 

GGAAGATGAA GTCATTAACT ATGCAAAATG CTTCTCAGGC ACCTAGGAAA ATGAGGATGT 608 4 

GAGGCATTTC TACCCACTTG GTACATAAAA TTATTGGGTC ACCCTTTTCC TCTTCTTTTT 614 4 

TTCTCCAG AAC CCA CCA GTC TTG AAA CGC CAT CAA CGG GAA ATA ACT CGT 6194 
Asn Pro Pro Val Leu Lys Arg His Gin Arg Glu lie Thr Arg 
1 5 10 

ACT ACT CTT CAG TCA GAT CAA GAG GAA ATT GAC TAT GAT GAT ACC ATA 62 4 2 

Thr Thr Leu Gin Ser Asp Gin Glu Glu lie Asp Tyr Asp Asp Thr lie 
15 20 25 30 

TCA GTT GAA ATG AAG AAG GAA GAT TTT GAC ATT TAT GAT GAG GAT GAA 62 90 

Ser Val Glu Met Lys Lys Glu Asp Phe Asp lie Tyr Asp Glu Asp Glu 
35 40 45 

AAT CAG AGC CCC CGC AGC TTT CAA AAG AAA ACA CGA CAC TAT TTT ATT 6338 
Asn Gin Ser Pro Arg Ser Phe Gin Lys Lys Thr Arg His Tyr Phe lie 
50 55 60 

GCT GCA GTG GAG AGG CTC TGG GAT TAT GGG ATG AGT AGC TCC CCA CAT 638 6 

Ala Ala Val Glu Arg Leu Trp Asp Tyr Gly Met Ser Ser Ser Pro His 
65 70 75 

GTT CTA AGA AAC AGG GCT CAG AGT GGC AGT GTC CCT CAG TTC AAG AAA 64 34 

Val Leu Arg Asn Arg Ala Gin Ser Gly Ser Val Pro Gin Phe Lys Lys 
80 85 90 

GTT GTT TTC CAG GAA TTT ACT GAT GGC TCC TTT ACT CAG CCC TTA TAC 64 82 

Val Val Phe Gin Glu Phe Thr Asp Gly Ser Phe Thr Gin Pro Leu Tyr 
95 100 105 110 

CGT GGA GAA CTA AAT GAA CAT TTG GGA CTC CTG GGG CCA TAT ATA AGA 6530 
Arg Gly Glu Leu Asn Glu His Leu Gly Leu Leu Gly Pro Tyr lie Arg 
115 120 125 

GCA GAA GTT GAA GAT AAT ATC ATG GTA ACT TTC AGA AAT CAG GCC TCT 657 8 

Ala Glu Val Glu Asp Asn lie Met Val Thr Phe Arg Asn Gin Ala Ser 
130 135 140 

CGT CCC TAT TCC TTC TAT TCT AGC CTT ATT TCT TAT GAG GAA GAT CAG 662 6 

Arg Pro Tyr Ser Phe Tyr Ser Ser Leu lie Ser Tyr Glu Glu Asp Gin 
145 150 155 

AGG CAA GGA GCA GAA CCT AGA AAA AAC TTT GTC AAG CCT AAT GAA ACC 667 4 

Arg Gin Gly Ala Glu Pro Arg Lys Asn Phe Val Lys Pro Asn Glu Thr 
160 165 170 

AAA ACT TAC TTT TGG AAA GTG CAA CAT CAT ATG GCA CCC ACT AAA GAT 6722 
Lys Thr Tyr Phe Trp Lys Val Gin His His Met Ala Pro Thr Lys Asp 
175 180 185 190 

GAG TTT GAC TGC AAA GCC TGG GCT TAT TTC TCT GAT GTT GAC CTG GAA 677 0 

Glu Phe Asp Cys Lys Ala Trp Ala Tyr Phe Ser Asp Val Asp Leu Glu 
195 200 205 
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AAA GAT GTG CAC TCA GGC CTG ATT GGA CCC CTT CTG GTC TGC CAC ACT 6818 
Lys Asp Val His Ser Gly Leu lie Gly Pro Leu Leu Val Cys His Thr 
210 215 220 

AAC ACA CTG AAC CCT GCT CAT GGG AGA CAA GTG ACA GTA CAG GAA TTT 68 66 

Asn Thr Leu'Asn Pro Ala His Gly Arg Gin Val Thr Val Gin Glu Phe 
225 230 235 

GCT CTG TTT TTC ACC ATC TTT GAT GAG ACC AAA AGC TGG TAC TTC ACT 6914 
Ala Leu Phe Phe Thr lie Phe Asp Glu Thr Lys Ser Trp Tyr Phe Thr 
240 245 250 

GAA AAT ATG GAA AGA AAC TGC AGG GCT CCC TGC AAT ATC CAG ATG GAA 6962 
Glu Asn Met Glu Arg Asn Cys Arg Ala Pro Cys Asn lie Gin Met Glu 
255 260 265 270 

GAT CCC ACT TTT AAA GAG AAT TAT CGC TTC CAT GCA ATC AAT GGC TAC 7010 
Asp Pro Thr Phe Lys Glu Asn Tyr Arg Phe His Ala lie Asn Gly Tyr 
275 280 285 

ATA ATG GAT ACA CTA CCT GGC TTA GTA ATG GCT CAG GAT CAA AGG ATT 7 058 

lie Met Asp Thr Leu Pro Gly Leu Val Met Ala Gin Asp Gin Arg lie 
290 295 300 

CGA TGG TAT CTG CTC AGC ATG GGC AGC AAT GAA AAC ATC CAT TCT ATT 710 6 

Arg Trp Tyr Leu Leu Ser Met Gly Ser Asn Glu Asn lie His Ser lie 
305 310 315 

CAT TTC AGT GGA CAT GTG TTC ACT GTA CGA AAA AAA GAG GAG TAT AAA 715 4 

His Phe Ser Gly His Val Phe Thr Val Arg Lys Lys Glu Glu Tyr Lys 
320 325 330 

ATG GCA CTG TAC AAT CTC TAT CCA GGT GTT TTT GAG ACA GTG GAA ATG 7202 
Met Ala Leu Tyr Asn Leu Tyr Pro Gly Val Phe Glu Thr Val Glu Met 
335 340 345 350 

TTA CCA TCC AAA GCT GGA ATT TGG CGG GTG GAA TGC CTT ATT GGC GAG 7 250 

Leu Pro Ser Lys Ala Gly lie Trp Arg Val Glu Cys Leu lie Gly Glu 
355 360 365 

CAT CTA CAT GCT GGG ATG AGC ACA CTT TTT CTG GTG TAC AGC AAT AAG 72 98 

His Leu His Ala Gly Met Ser Thr Leu Phe Leu Val Tyr Ser Asn Lys 
370 375 380 

TGT CAG ACT CCC CTG GGA ATG GCT TCT GGA CAC ATT AGA GAT TTT CAG 7 34 6 

Cys Gin Thr Pro Leu Gly Met Ala Ser Gly His lie Arg Asp Phe Gin 
385 390 395 

ATT ACA GCT TCA GGA CAA TAT GGA CAG TGG GCC CCA AAG CTG GCC AGA 7 394 
lie Thr Ala Ser Gly Gin Tyr Gly Gin Trp Ala Pro Lys Leu Ala Arg 
400 405 410 

CTT CAT TAT TCC GGA TCA ATC AAT GCC TGG AGC ACC AAG GAG CCC TTT 74 4 2 

Leu His Tyr Ser Gly. Ser lie Asn Ala Trp Ser Thr Lys Glu Pro Phe 
415 420 425 430 
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TCT TGG ATC AAG GTG GAT CTG TTG GCA CCA ATG ATT ATT CAC GGC ATC 74 90 

Ser Trp He Lys Val Asp Leu Leu Ala Pro Met He lie His Gly He 
435 440 445 

AAG ACC CAG GGT GCC CGT CAG AAG TTC TCC AGC CTC TAC ATC TCT CAG 7 538 

Lys Thr Gin Gly Ala Arg Gin Lys Phe Ser Ser Leu Tyr He Ser Gin 
450 455 460 

TTT ATC ATC ATG TAT AGT CTT GAT GGG AAG AAG TGG CAG ACT TAT CGA 7 58 6 
Phe He He Met Tyr Ser Leu Asp Gly Lys Lys Trp Gin Thr Tyr Arg 
465 470 475 

GGA AAT TCC ACT GGA ACC TTA'ATG GTC TTC TTT GGC AAT GTG GAT TCA 7634 
Gly Asn Ser Thr Gly Thr Leu Met Val Phe Phe Gly Asn Val Asp Ser 
480 485 490 

TCT GGG ATA AAA CAC AAT ATT TTT AAC CCT CCA ATT ATT GCT CGA TAC 7 682 

Ser Gly He Lys His Asn He Phe Asn Pro Pro He He Ala Arg Tyr 
495 500 505 510 

ATC CGT TTG CAC CCA ACT CAT TAT AGC ATT CGC AGC ACT CTT CGC ATG 7 7 30 

He Arg Leu His Pro Thr His Tyr Ser He Arg Ser Thr Leu Arg Met 
515 520 525 

GAG TTG ATG GGC TGT GAT TTA AAT AGT TGC AGC ATG CCA TTG GGA ATG 7 77 8 

Glu Leu Met Gly Cys Asp Leu Asn Ser Cys Ser Met Pro Leu Gly Met 
530 535 540 

GAG AGT AAA GCA ATA TCA GAT GCA CAG ATT ACT GCT TCA TCC TAC TTT 782 6 

Glu Ser Lys Ala He Ser Asp Ala Gin He Thr Ala Ser Ser Tyr Phe 
545 550 555 

ACC AAT ATG TTT GCC ACC TGG TCT CCT TCA AAA GCT CGA CTT CAC CTC 7 874 

Thr Asn Met Phe Ala Thr Trp Ser Pro Ser Lys Ala Arg Leu His Leu 
560 565 570 

CAA GGG AGG AGT AAT GCC TGG AGA CCT CAG GTG AAT AAT CCA AAA GAG 7 922 

Gin Gly Arg Ser Asn Ala Trp Arg Pro Gin Val Asn Asn Pro Lys Glu 
575 580 585 590 

TGG CTG CAA GTG GAC TTC CAG AAG ACA ATG AAA GTC ACA GGA GTA ACT 7 97 0 

Trp Leu Gin Val Asp Phe Gin Lys Thr Met Lys Val Thr Gly Val Thr 
595 600 605 

ACT CAG GGA GTA AAA TCT CTG CTT ACC AGC ATG TAT GTG AAG GAG TTC 8018 
Thr Gin Gly Val Lys Ser Leu Leu Thr Ser Met Tyr Val Lys Glu Phe 
610 615 620 

CTC ATC TCC AGC AGT CAA GAT GGC CAT CAG TGG ACT CTC TTT TTT CAG 8 0 66 

Leu He Ser Ser Ser Gin Asp Gly His Gin Trp Thr Leu Phe Phe Gin 
625 630 635 

AAT GGC AAA GTA AAG GTT TTT CAG GGA AAT CAA GAC TCC TTC ACA CCT 8114 
Asn Gly Lys Val Lys Val Phe Gin Gly Asn Gin Asp Ser Phe Thr Pro 
640 645 650 
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GTG GTG AAC TCT ttA GAC CCA CCG TTA CTG ACT CGC TAC CTT CGA ATT 
Val Val Asn Ser Leu Asp Pro Pro Leu Leu Thr Arg Tyr Leu Arg lie 
655 660 665 670 





8162 



CAC CCC CAG AGT TGG GTG CAC CAG ATT GCC CTG AGG ATG GAG GTT CTG 
His Pro Gin Ser Trp Val His Gin lie Ala Leu Arg Met Glu Val Leu 
675 680 685 



8210 



GGC TGC GAG GCA CAG GAC CTC TAC TGAGGGTGGC CACTGCAGCA CCTGCCACTG 82 64 
Gly Cys Glu Ala Gin Asp Leu Tyr 



CCGTCACCTC TCCCTCCTCA GCTCCAGGGC AGTGTCCCTC CCTGGCTTGC CTTCTACCTT 8 32 4 

TGTGCTAAAT CCTAGCAGAC ACTGCCTTGA AGCCTCCTGA ATTAACTATC ATCAGTCCTG 8 38 4 

CATTTCTTTG GTGGGGGGCC AGGAGGGTGC ATCCAATTTA ACTTAACTCT TACCTATTTT 84 4 4 

CTGCAGCTGC TCCCAGATTA CTCCTTCCTT CCAATATAAC TAGGCAAAAA GAAGTGAGGA 8 50 4 

GAAACCTGCA TGAAAGCATT CTTCCCTGAA AAGTTAGGCC TCTCAGAGTC ACCACTTCCT 85 64 

CTGTTGTAGA AAAACTATGT GATGAAACTT TGAAAAAGAT ATTTATGATG TTAACTTGTT 8 62 4 

TATTGCAGCT TATAATGGTT ACAAATAAAG CAATAGCATC ACAAATTTCA CAAATAAAGC 8 68 4 

ATTTTTTTCA CTGCATTCTA GTTGTGGTTT GTCCAAACTC ATCAATGTAT CTTATCATGT 874 4 

CTGGATCCCC GGGTGGCATC CCTGTGACCC CTCCCCAGTG CCTCTCCTGG CCCTGGAAGT 8804 

TGCCACTCCA GTGCCCACCA GCCTTGTCCT AATAAAATTA AGTTGCATCA TTTTGTCTGA 8 8 64 

CTAGGTGTCC TTCTATAATA TTATGGGGTG GAGGGGGGTG GTATGGAGCA AGGGGCAAGT 8 92 4 

TGGGAAGACA ACCTGTAGGG CCTGCGGGGT CTATTCGGGA ACCAAGCTGG AGTGCAGTGG 8 98 4 

CACAATCTTG GCTCACTGCA ATCTCCGCCT CCTGGGTTCA AGCGATTCTC CTGCCTCAGC 904 4 

CTCCCGAGTT GTTGGGATTC CAG G CAT GCA TGACCAGGCT CAGCTAATTT TTGTTTTTTT 9104 

GGTAGAGACG GGGTTTCACC ATATTGGCCA GGCTGGTCTC CAACTCCTAA TCTCAGGTGA 9164 

TCTACCCACC TTGGCCTCCC AAATTGCTGG GATTACAGGC GTGAACCACT GCTCCCTTCC 922 4 

CTGTCCTTCT GATTTTAAAA TAACTATACC AGCAGGAGGA CGTCCAGACA CAGCATAGGC 92 8 4 

TACCTGCCAT GCCCAACCGG TGGGACATTT GAGTTGCTTG CTTGGCACTG TCCTCTCATG 934 4 

CGTTGGGTCC ACTCAGTAGA TGCCTGTTGA ATTCGTAATC ATGGTCATAG CTGTTTCCTG 94 04 

TGTGAAATTG TTATCCGCTC ACAATTCCAC ACAACATACG AGCCGGAAGC ATAAAGTGTA 94 64 

AAGCCTGGGG TGCCTAATGA GTGAGCTAAC TCACATTAAT TGCGTTGCGC TCACTGCCCG 9524 

CTTTCCAGTC GGGAAACCTG TCGTGCCAGC TGCATTAATG AATCGGCCAA CGCGCGGGGA 958 4 

GAGGCGGTTT GCGTATTGGG CGCTCTTCCG CTTCCTCGCT CACTGACTCG CTGCGCTCGG 964 4 



690 
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TCGTTCGGCT GCGGCGAGCG GTATCAGCTC ACTCAAAGGC GGTAATACGG TTATCCACAG 9704 
AATCAGGGGA TAACGCAGGA AAGAACATGT GAGCAAAAGG CCAGCAAAAG GCCAGGAACC 97 64 
GTAAAAAGGC CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC GAGCATCACA 982 4 
AAAATCGACG CTCAAGTCAG AGGTGGCGAA ACCCGACAGG ACTATAAAGA TACCAGGCGT 9884 
TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC CTGTTCCGAC CCTGCCGCTT ACCGGATACC 9 94 4 
TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG CGCTTTCTCA TAGCTCACGC TGTAGGTATC 10004 
TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC TGGGCTGTGT GCACGAACCC CCCGTTCAGC 100 64 
CCGACCGCTG CGCCTTATCC GGTAACTATC GTCTTGAGTC CAACCCGGTA AGACACGACT 1012 4 
TATCGCCACT GGCAGCAGCC ACTGGTAACA GGATTAGCAG AGCGAGGTAT GTAGGCGGTG 10184 
CTACAGAGTT CTTGAAGTGG TGGCCTAACT ACGGCTACAC TAGAAGGACA GTATTTGGTA 1024 4 
TCTGCGCTCT GCTGAAGCCA GTTACCTTCG GAAAAAGAGT TGGTAGCTCT TGATCCGGCA 10304 
AACAAACCAC CGCTGGTAGC GGTGGTTTTT TTGTTTGCAA GCAGCAGATT ACGCGCAGAA 103 64 
AAAAAGGATC TCAAGAAGAT CCTTTGATCT TTTCTACGGG GTCTGACGCT CAGTGGAACG 104 24 
AAAACTCACG TTAAGGGATT TTGGTCATGA GAT TAT C AAA AAGGATCTTC ACCTAGATCC 104 8 4 
TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT ATATGAGTAA ACTTGGTCTG 10544 
ACAGTTACCA ATGCTTAATC AGTGAGGCAC CTATCTCAGC GATCTGTCTA TTTCGTTCAT 10 604 
CCATAGTTGC CTGACTCCCC GTCGTGTAGA TAACTACGAT ACGGGAGGGC TTACCATCTG 10664 
GCCCCAGTGC TGCAATGATA CCGCGAGACC CACGCTCACC GGCTCCAGAT TTATCAGCAA 10724 
TAAACCAGCC AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC TGCAACTTTA TCCGCCTCCA 10784 
TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT AATAGTTTGC 10844 
GCAACGTTGT TGCCATTGCT ACAGGCATCG TGGTGTCACG CTCGTCGTTT GGTATGGCTT 10 904 
CATTCAGCTC CGGTTCCCAA CGATCAAGGC GAGTTACATG ATCCCCCATG TTGTGCAAAA 10964 
AAGCGGTTAG CTCCTTCGGT CCTCCGATCG TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT 11024 
CACTCATGGT TAT G G C AG C A CTGCATAATT CTCTTACTGT CATGCCATCC GTAAGATGCT 11084 
TTTCTGTGAC TGGTGAGTAC TCAACCAAGT CATTCTGAGA ATAGTGTATG CGGCGACCGA 1114 4 
GTTGCTCTTG CCCGGCGTCA ATACGGGATA ATACCGCGCC ACATAGCAGA ACTTTAAAAG 11204 
TGCTCATCAT TGGAAAACGT TCTTCGGGGC GAAAACTCTC AAGGATCTTA CCGCTGTTGA 112 64 
GATCCAGTTC GATGTAACCC ACTGGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA 11324 
CCAGCGTTTC TGGGTGAGCA AAAACAGGAA GGCAAAATGC CGCAAAAAAG GGAATAAGGG 11384 
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CGACACGGAA ATGTTGAATA CTCATACTCT TCCTTTTTCA AT AT TAT T G A AGCATTTATC 114 4 4 
AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT T T AG AAAAAT AAACAAATAG 11504 
GGGTTCCGCG CACATTTCCC CGAAAAGTGC CACCTGACGT CTAAGAAACC ATTATTATCA 115 64 
TGACATTAAC CTATAAAAAT AGGCGTATCA CGAGGCCCTT TCGTCTCGCG CGTTTCGGTG 11624 
ATGACGGTGA AAACCTCTGA CACATGCAGC TCCCGGAGAC GGTCACAGCT TGTCTGTAAG 11684 
CGGATGCCGG GAGCAGACAA GCCCGTCAGG GCGCGTCAGC GGGTGTTGGC GGGTGTCGGG 1174 4 
GCTGGCTTAA CTATGCGGCA TCAGAGCAGA TTGTACTGAG AGTGCACCAT ATGCGGTGTG 11804 
AAATACCGCA CAGATGCGTA AGGAGAAAAT ACCGCATCAG GCGCCATTCG CCATTCAGGC 118 64 
TGCGCAACTG TTGGGAAGGG CGATCGGTGC GGGCCTCTTC GCTATTACGC CAGCTGGCGA 11924 
AAGGGGGATG TGCTGCAAGG CGATTAAGTT GGGTAACGCC AGGGTTTTCC CAGTCACGAC 11984 
GTTGTAAAAC GACGGCCAGT GCCAAGCTTG GGCTGCAG 12022 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1006.. 8058 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

GTCGACGGTA TCGATAAGCT TGATATCGAA TTCCTGCAGC CCGGGGGATC CACTAGTACT 60 

C GAG AC C TAG GAGTTAATTT TTAAAAAGCA GTCAAAAGTC CAAGTGGCCC TTGCGAGCAT 120 

TTACTCTCTC TGTTTGCTCT GGTTAATAAT CTCAGGAGCA CAAACATTCC TTACTAGTCC 18 0 

TAGAAGTTAA TTTTTAAAAA GCAGTCAAAA GTCCAAGTGG CCCTTGCGAG CATTTACTCT 24 0 

CTCTGTTTGC TCTGGTTAAT AATCTCAGGA GCACAAACAT TCCTTACTAG TTCTAGAGCG 300 

GCCGCCAGTG TGCTGGAATT CGGCTTTTTT AGGGCTGGAA GCTACCTTTG ACATCATTTC 360 

CTCTGCGAAT GCATGTATAA TTTCTACAGA ACCTATTAGA AAGGATCACC CAGCCTCTGC 4 20 

TTTTGTACAA CTTTCCCTTA AAAAACTGCC AATTCCACTG CTGTTTGGCC CAATAGTGAG 4 80 

AACTTTTTCC TGCTGCCTCT* TGGTGCTTTT GCCTATGGCC CCTATTCTGC CTGCTGAAGA 54 0 

CACTCTTGCC AGCATGGACT TAAACCCCTC CAGCTCTGAC AATCCTCTTT CTCTTTTGTT 600 
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C^^^AG CCAAAGCAAT CACTCAAAGT TCA^^^T 



TTACATGAAG GGTCTGGCAG CCAAAGCAAT CACTCAAAGT TCAAACCTTA TCATTTTTTG 660 

CTTTGTTCCT CTTGGCCTTG GTTTTGTACA TCAGCTTTGA AAATACCATC CCAGGGTTAA 7 20 

TGCTGGGGTT AATTTATAAC TAAGAGTGCT CTAGTTTTGC AAT AC AG G AC ATGCTATAAA 7 80 

AATGGAAAGA TGTTGCTTTC TGAGAGATCT CGAGGAAGCT AACAACAAAG AACAACAAAC 84 0 

AACAATCAGG TAAGTATCCT TTTTACAGCA CAACTTAATG AGACAGATAG AAACTGGTCT 900 

TGTAGAAACA GAGTAGTCGC CTGCTTTTCT GCCAGGTGCT GACTTCTCTC CCCTTCTCTT 960 

TTTTCCTTTT CTCAGGATAA CAAGAACGAA ACAATAACAG CCACC ATG GAA ATA 1014 

Met Glu lie 
1 

GAG CTC TCC ACC TGC TTC TTT CTG TGC CTT TTG CGA TTC TGC TTT AGT 10 62 

Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe Cys Phe Ser 
5 10 15 

GCC ACC AGA AGA TAC TAC CTG GGT GCA GTG GAA CTG TCA TGG GAC TAT 1110 
Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser Trp Asp Tyr 
20 25 30 35 

ATG CAA AGT GAT CTC GGT GAG CTG CCT GTG GAC GCA AGA TTT CCT CCT 1158 
Met Gin Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg Phe Pro Pro 
40 45 50 

AGA GTG CCA AAA TCT TTT CCA TTC AAC ACC TCA GTC GTG TAC AAA AAG 120 6 

Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val Tyr Lys Lys 
55 60 65 

ACT CTG TTT GTA GAA TTC ACG GTT CAC CTT TTC AAC ATC GCT AAG CCA 12 54 

Thr Leu Phe Val Glu Phe Thr Val His Leu Phe Asn lie Ala Lys Pro 
70 75 ' 80 

AGG CCA CCC TGG ATG GGT CTG CTA GGT CCT ACC ATC CAG GCT GAG GTT 1302 
Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr lie Gin Ala Glu Val 
85 90 95 

TAT GAT ACA GTG GTC ATT ACA CTT AAG AAC ATG GCT TCC CAT CCT GTC 1350 
Tyr Asp Thr Val Val lie Thr Leu Lys Asn Met Ala Ser His Pro Val 
100 105 110 115 

AGT CTT CAT GCT GTT GGT GTA TCC TAC TGG AAA GCT TCT GAG GGA GCT 1398 
Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser Glu Gly Ala 
120 125 130 

GAA TAT GAT GAT CAG ACC AGT CAA AGG GAG AAA GAA GAT GAT AAA GTC 14 4 6 

Glu Tyr Asp Asp Gin Thr Ser Gin Arg Glu Lys Glu Asp Asp Lys Val 
135 140 145 

TTC CCT GGT GGA AGC CAT ACA TAT GTC TGG CAG GTC CTG AAA GAG AAT 14 94 

Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gin Val Leu Lys Glu Asn 
150 155 160 
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GGT CCA ATG GCC TCT GAC CCA CTG TGC CTT ACC TAC TCA TAT CTT TCT 154 2 

Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser Tyr Leu Ser 
165 170 175 

CAT GTG GAC CTG GTA AAA GAC TTG AAT TCA GGC CTC ATT GGA GCC CTA 1590 
His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu lie Gly Ala Leu 
180 185 190 195 

CTA GTA TGT AGA GAA GGG AGT CTG GCC AAG GAA AAG ACA CAG ACC TTG 1638 
Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr Gin Thr Leu 
200 205 210 

CAC AAA TTT ATA CTA CTT TTT GCT GTA TTT GAT GAA GGG AAA AGT TGG 168 6 

His Lys Phe lie Leu Leu Phe Ala Val Phe Asp Glu Gly Lys Ser Trp 
215 220 225 

CAC TCA GAA ACA AAG AAC TCC TTG ATG CAG GAT AGG GAT GCT GCA TCT 17 34 

His Ser Glu Thr Lys Asn Ser Leu Met Gin Asp Arg Asp Ala Ala Ser 
230 235 240 

GCT CGG GCC TGG CCT AAA ATG CAC ACA GTC AAT GGT TAT GTA AAC AGG 1782 
Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr Val Asn Arg 
245 250 255 

TCT CTG CCA GGT CTG ATT GGA TGC CAC AGG AAA TCA GTC TAT TGG CAT 18 30 

Ser Leu Pro Gly Leu lie Gly Cys His Arg Lys Ser Val Tyr Trp His 
260 265 270 275 

GTG ATT GGA ATG GGC ACC ACT CCT GAA GTG CAC TCA ATA TTC CTC GAA 18 7 8 

Val lie Gly Met Gly Thr Thr Pro Glu Val His Ser lie Phe Leu Glu 
280 285 290 

GGT CAC ACA TTT CTT GTG AGG AAC CAT CGC CAG GCG TCC TTG GAA ATC 192 6 

Gly His Thr Phe Leu Val Arg Asn His Arg Gin Ala Ser Leu Glu lie 
295 300 305 

TCG CCA ATA ACT TTC CTT ACT GCT CAA ACA CTC TTG ATG GAC CTT GGA 197 4 

Ser Pro lie Thr Phe Leu Thr Ala Gin Thr Leu Leu Met Asp Leu Gly 
310 315 320 

CAG TTT CTA CTG TTT TGT CAT ATC TCT TCC CAC CAA CAT GAT GGC ATG 2022 
Gin Phe Leu Leu Phe Cys His lie Ser Ser His Gin His Asp Gly Met 
325 330 335 

GAA GCT TAT GTC AAA GTA GAC AGC TGT CCA GAG GAA CCC CAA CTA CGA 207 0 

Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro Gin Leu Arg 
340 345 350 355 

ATG AAA AAT AAT GAA GAA GCG GAA GAC TAT GAT GAT GAT CTT ACT GAT 2118 
Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp Leu Thr Asp 
360 365 370 

TCT GAA ATG GAT GTG GTC AGG TTT GAT GAT GAC AAC TCT CCT TCC TTT 2166 
Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser Pro Ser Phe 
375 380 385 
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ATC CAA ATT CGC TCA GTT GCC AAG AAG CAT CCT AAA ACT TGG GTA CAT 2214 
lie Gin lie Arg Ser Val Ala Lys Lys His Pro Lys Thr Trp Val His 
390 395 400 

TAC ATT GCT GCT GAA GAG GAG GAC TGG GAC TAT GCT CCC TTA GTC CTC 22 62 
Tyr lie Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro Leu Val Leu 
405 410 415 

GCC CCC GAT GAC AGA AGT TAT AAA AGT CAA TAT TTG AAC AAT GGC CCT 2310 
Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gin Tyr Leu Asn Asn Gly Pro 
420 425 430 435 

CAG CGG ATT GGT AGG AAG TAC AAA AAA GTC CGA TTT ATG GCA TAC ACA 2 358 
Gin Arg lie Gly Arg Lys Tyr Lys Lys Val Arg Phe Met Ala Tyr Thr 
440 445 450 

GAT GAA ACC TTT AAG ACT CGT GAA GCT ATT CAG CAT GAA TCA GGA ATC 2 4 06 

Asp Glu Thr Phe Lys Thr Arg Glu Ala lie Gin His Glu Ser Gly lie 
455 460 465 

TTG GGA CCT TTA CTT TAT GGG GAA GTT GGA GAC ACA CTG TTG ATT ATA 2 4 54 
Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu Leu lie lie 
470 475 480 

TTT AAG AAT CAA GCA AGC AGA CCA TAT AAC ATC TAC CCT CAC GGA ATC 2502 
Phe Lys Asn Gin Ala Ser Arg Pro Tyr Asn lie Tyr Pro His Gly lie 
485 490 495 

ACT GAT GTC CGT CCT TTG TAT TCA AGG AGA TTA CCA AAA GGT GTA AAA 2 550 

Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys Gly Val Lys 
500 505 510 515 

CAT TTG AAG GAT TTT CCA ATT CTG CCA GGA GAA ATA TTC AAA TAT AAA 25 98 

His Leu Lys Asp Phe Pro lie Leu Pro Gly Glu lie Phe Lys Tyr Lys 
520 525 530 

TGG ACA GTG ACT GTA GAA GAT GGG CCA ACT AAA TCA GAT CCT CGG TGC 2 64 6 

Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp Pro Arg Cys 
535 540 545 

CTG ACC CGC TAT TAC TCT AGT TTC GTT AAT ATG GAG AGA GAT CTA GCT -2 694 
Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg Asp Leu Ala 
550 555 560 

TCA GGA CTC ATT GGC CCT CTC CTC ATC TGC TAC AAA GAA TCT GTA GAT 27 4 2 

Ser Gly Leu lie Gly Pro Leu Leu lie Cys Tyr Lys Glu Ser Val Asp 
565 570 575 

CAA AGA GGA AAC CAG ATA ATG TCA GAC AAG AGG AAT GTC ATC CTG TTT 27 90 
Gin Arg Gly Asn Gin lie Met Ser Asp Lys Arg Asn Val lie Leu Phe 
580 585 590 595 

TCT GTA TTT GAT GAG AAC CGA AGC TGG TAC CTC ACA GAG AAT ATA CAA 28 38 
Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu Asn lie Gin 
600 605 610 
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CGC TTT CTC CCC AAT CCA GCT GGA GTG CAG CTT GAG GAT CCA GAG TTC 288 6 

Arg Phe Leu Pro Asn Pro Ala Gly Val Gin Leu Glu Asp Pro Glu Phe 
615 620 625 

CAA GCC TCC AAC ATC ATG CAC AGC ATC AAT GGC TAT GTT TTT GAT AGT 2 934 
Gin Ala Ser Asn lie Met His Ser lie Asn Gly Tyr Val Phe Asp Ser 
630 635 640 

TTG CAG TTG TCA GTT TGT TTG CAT GAG GTG GCA TAC TGG TAC ATT CTA 2 982 

Leu Gin Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp Tyr lie Leu 
645 650 655 

AGC ATT GGA GCA CAG ACT GAC TTC CTT TCT GTC TTC TTC TCT GGA TAT 3030 
Ser lie Gly Ala Gin Thr Asp Phe Leu Ser Val Phe Phe Ser Gly Tyr 
660 665 670 675 

ACC TTC AAA CAC AAA ATG GTC TAT GAA GAC ACA CTC ACC CTA TTC CCA 307 8 

Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr Leu Phe Pro 
680 685 690 

TTC TCA GGA GAA ACT GTC TTC ATG TCG ATG GAA AAC CCA GGT CTA TGG 312 6 

Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro Gly Leu Trp 
695 700 705 

ATT CTG GGG TGC CAC AAC TCA GAC TTT CGG AAC AGA GGC ATG ACC GCC 317 4 

lie Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly Met Thr Ala 
710 715 720 

TTA CTG AAG GTT TCT AGT TGT GAC AAG AAC ACT GGT GAT TAT TAC GAG 3222 
Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp Tyr Tyr Glu 
725 730 735 

GAC AGT TAT GAA GAT ATT TCA GCA TAC TTG CTG AGT AAA AAC AAT GCC 3270 
Asp Ser Tyr Glu Asp lie Ser Ala Tyr Leu Leu Ser Lys Asn Asn Ala 
740 745 750 755 

ATT GAA CCA AGA AGC TTC TCC CAG AAT TCA AGA CAC CCT AGC ACT AGG 3318 
lie Glu Pro Arg Ser Phe Ser Gin Asn Ser Arg His Pro Ser Thr Arg 
760 765 770 

CAA AAG CAA TTT AAT GCC ACC ACA ATT CCA GAA AAT GAC ATA GAG AAG 33 66 

Gin Lys Gin Phe Asn Ala Thr Thr lie Pro Glu Asn Asp lie Glu Lys 
775 780 ' 785 

ACT GAC CCT TGG TTT GCA CAC AGA ACA CCT ATG CCT AAA ATA CAA AAT 3414 
Thr Asp Pro Trp Phe Ala His Arg Thr Pro Met Pro Lys lie Gin Asn 
790 795 800 

GTC TCC TCT AGT GAT TTG TTG ATG CTC TTG CGA CAG AGT CCT ACT CCA 34 62 

Val Ser Ser Ser Asp Leu Leu Met Leu Leu Arg Gin Ser Pro Thr Pro 
805 810 815 

CAT GGG CTA TCC TTA TCT GAT CTC CAA GAA GCC AAA TAT GAG ACT TTT 3510 
His Gly Leu Ser Leu Ser Asp Leu Gin Glu Ala Lys Tyr Glu Thr Phe 
820 825 830 835 
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TCT GAT GAT CCA TCA CCT GGA GCA ATA GAC AGT AAT AAC AGC CTG TCT 3558 
Ser Asp Asp Pro Ser Pro Gly Ala lie Asp Ser Asn Asn Ser Leu Ser 
840 845 850 

GAA ATG ACA CAC TTC AGG CCA CAG CTC CAT CAC AGT GGG GAC ATG GTA 3606 
Glu Met Thr His Phe Arg Pro Gin Leu His His Ser Gly Asp Met Val 
855 860 865 

TTT ACC CCT GAG TCA GGC CTC CAA TTA AGA TTA AAT GAG AAA CTG GGG 365 4 

Phe Thr Pro Glu Ser Gly Leu Gin Leu Arg Leu Asn Glu Lys Leu Gly 
870 875 880 

ACA ACT GCA GCA ACA GAG TTG AAG AAA CTT GAT TTC AAA GTT TCT AGT 3702 
Thr Thr Ala Ala Thr Glu Leu Lys Lys Leu Asp Phe Lys Val Ser Ser 
885 890 895 

ACA TCA AAT AAT CTG ATT TCA ACA ATT CCA TCA GAC AAT TTG GCA GCA 37 50 

Thr Ser Asn Asn Leu lie Ser Thr lie Pro Ser Asp Asn Leu Ala Ala 
900 905 910 915 

GGT ACT GAT AAT ACA AGT TCC TTA GGA CCC CCA AGT ATG CCA GTT CAT 37 98 

Gly Thr Asp Asn Thr Ser Ser Leu Gly Pro Pro Ser Met Pro Val His 
920 925 930 

TAT GAT AGT CAA TTA GAT ACC ACT CTA TTT GGC AAA AAG TCA TCT CCC 38 4 6 

Tyr Asp Ser Gin Leu Asp Thr Thr Leu Phe Gly Lys Lys Ser Ser Pro 
935 940 945 

CTT ACT GAG TCT GGT GGA CCT CTG AGC TTG AGT GAA GAA AAT AAT GAT 38 94 

Leu Thr Glu Ser Gly Gly Pro Leu Ser Leu Ser Glu Glu Asn Asn Asp 
950 955 960 

TCA AAG TTG TTA GAA TCA GGT TTA ATG AAT AGC CAA GAA AGT TCA TGG 3942 
Ser Lys Leu Leu Glu Ser Gly Leu Met Asn Ser Gin Glu Ser Ser Trp 
965 970 975 

GGA AAA AAT GTA TCG TCA ACA GAG AGT GGT AGG TTA TTT AAA GGG AAA 3990 
Gly Lys Asn Val Ser Ser Thr Glu Ser Gly Arg Leu Phe Lys Gly Lys 
980 985 990 995 

AGA GCT CAT GGA CCT GCT TTG TTG ACT AAA GAT AAT GCC TTA TTC AAA 4 038 

Arg Ala His Gly Pro Ala Leu Leu Thr Lys Asp Asn Ala Leu Phe Lys 
1000 1005 1010 

GTT AGC ATC TCT TTG TTA AAG ACA AAC AAA ACT TCC AAT AAT TCA GCA 4 08 6 

Val Ser lie Ser Leu Leu Lys Thr Asn Lys Thr Ser Asn Asn Ser Ala 
1015 1020 1025 

ACT AAT AGA AAG ACT CAC ATT GAT GGC CCA TCA TTA TTA ATT GAG AAT 4134 
Thr Asn Arg Lys Thr His lie Asp Gly Pro Ser Leu Leu lie Glu Asn 
1030 1035 1040 

AGT CCA TCA GTC TGG CAA AAT ATA TTA GAA AGT GAC ACT GAG TTT AAA 4182 
Ser Pro Ser Val Trp Gin Asn lie Leu Glu Ser Asp Thr Glu Phe Lys 
1045 1050 1055 
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AAA GTG ACA CCT TTG ATT CAT GAC AGA ATG CTT ATG GAC AAA AAT GCT 4 230 

Lys Val Thr Pro Leu lie His Asp Arg Met Leu Met Asp Lys Asn Ala 
1060 1065 1070 1075 

ACA GCT TTG AGG CTA AAT CAT ATG TCA AAT AAA ACT ACT TCA TCA AAA 4 27 8 

Thr Ala Leu Arg Leu Asn His Met Ser Asn Lys Thr Thr Ser Ser Lys 
1080 1085 1090 

AAC ATG GAA ATG GTC CAA CAG AAA AAA GAG GGC CCC ATT CCA CCA GAT 4 326 

Asn Met Glu Met Val Gin Gin Lys Lys Glu Gly Pro lie Pro Pro Asp 
1095 1100 " 1105 

GCA CAA AAT CCA GAT ATG TCG TTC TTT AAG ATG CTA TTC TTG CCA GAA 4.37 4 

Ala Gin Asn Pro Asp Met Ser Phe Phe Lys Met Leu Phe Leu Pro Glu 
1110 1115 1120 

TCA GCA AGG TGG ATA CAA AGG ACT CAT GGA AAG AAC TCT CTG AAC TCT 4 4 22 

Ser Ala Arg Trp lie Gin Arg Thr His Gly Lys Asn Ser Leu Asn Ser 
1125 1130 1135 

GGG CAA GGC CCC AGT CCA AAG CAA TTA GTA TCC TTA GGA CCA GAA AAA 4 4 70 

Gly Gin Gly Pro Ser Pro Lys Gin Leu Val Ser Leu Gly Pro Glu Lys 
1140 1145 1150 1155 

TCT GTG GAA GGT CAG AAT TTC TTG TCT GAG AAA AAC AAA GTG GTA GTA 4 518 

Ser Val Glu Gly Gin Asn Phe Leu Ser Glu Lys Asn Lys Val Val Val 
1160 1165 1170 

GGA AAG GGT GAA TTT ACA AAG GAC GTA GGA CTC AAA GAG ATG GTT TTT 4 566 

Gly Lys Gly Glu Phe Thr Lys Asp Val Gly Leu Lys Glu Met Val Phe 
1175 1180 1185 

CCA AGC AGC AGA AAC CTA TTT CTT ACT AAC TTG GAT AAT TTA CAT GAA 4 614 

Pro Ser Ser Arg Asn Leu Phe Leu Thr Asn Leu Asp Asn Leu His Glu 
1190 1195 1200 

AAT AAT ACA CAC AAT CAA GAA AAA AAA ATT CAG GAA GAA ATA GAA AAG 4 662 

Asn Asn Thr His Asn Gin Glu Lys Lys lie Gin Glu Glu lie Glu Lys 
1205 1210 1215 

AAG GAA ACA TTA ATC CAA GAG AAT GTA GTT TTG CCT CAG ATA CAT ACA 4 710 

Lys Glu Thr Leu He Gin Glu Asn Val Val Leu Pro Gin He His Thr 
1220 1225 1230 1235 

GTG ACT GGC ACT AAG AAT TTC ATG AAG AAC CTT TTC TTA CTG AGC ACT 4 7 58 

Val Thr Gly Thr Lys Asn Phe Met Lys Asn Leu Phe Leu Leu Ser Thr 
1240 1245 1250 

AGG CAA AAT GTA GAA GGT TCA TAT GAG GGG GCA TAT GCT CCA GTA CTT 4 80 6 

Arg Gin Asn Val Glu Gly Ser Tyr Glu Gly Ala Tyr Ala Pro Val Leu 
1255 1260 1265 

CAA GAT TTT AGG TCA TTA AAT GAT TCA ACA AAT AGA ACA AAG AAA CAC 4 8 54 

Gin Asp Phe Arg Ser Leu Asn Asp Ser Thr Asn Arg Thr Lys Lys His 
1270 1275 1280 
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C^^^ AAA AAA GGG GAG GAA GAA AAC^^ 



ACA GCT CAT TTC TCA AAA AAA GGG GAG GAA GAA AAC TTG GAA GGC TTG 4 902 

Thr Ala His Phe Ser Lys Lys Gly Glu Glu Glu Asn Leu Glu Gly Leu 
1285 1290 1295 

GGA AAT* CAA ACC AAG CAA ATT GTA GAG AAA TAT GCA TGC ACC ACA AGG 4 950 

Gly Asn Gin Thr Lys Gin lie Val Glu Lys Tyr Ala Cys Thr Thr Arg 
1300 1305 1310 1315 

ATA TCT CCT AAT ACA AGC CAG CAG AAT TTT GTC ACG CAA CGT AGT AAG 4 998 

lie Ser Pro Asn Thr Ser Gin Gin Asn Phe Val Thr Gin Arg Ser Lys 
1320 1325 1330 

AGA GCT TTG AAA CAA TTC AGA CTC CCA CTA GAA GAA ACA GAA CTT GAA 504 6 

Arg Ala Leu Lys Gin Phe Arg Leu Pro Leu Glu Glu Thr Glu Leu Glu 
1335 1340 1345 

AAA AGG ATA ATT GTG GAT GAC ACC TCA ACC CAG TGG TCC AAA AAC ATG 5094 
Lys Arg lie lie Val Asp Asp Thr Ser Thr Gin Trp Ser Lys Asn Met 
1350 1355 1360 

AAA CAT TTG ACC CCG AGC ACC CTC ACA CAG ATA GAC TAC AAT GAG AAG 514 2 

Lys His Leu Thr Pro Ser Thr Leu Thr Gin lie Asp Tyr Asn Glu Lys 
1365 1370 1375 

GAG AAA GGG GCC ATT ACT CAG TCT CCC TTA TCA GAT TGC CTT ACG AGG 5190 
Glu Lys Gly Ala lie Thr Gin Ser Pro Leu Ser Asp Cys Leu Thr Arg 
1380 1385 1390 1395 

AGT CAT AGC ATC CCT CAA GCA AAT AGA TCT CCA TTA CCC ATT GCA AAG 5238 
Ser His Ser lie Pro Gin Ala Asn Arg Ser Pro Leu Pro lie Ala Lys 
1400 1405 1410 

GTA TCA TCA TTT CCA TCT ATT AGA CCT ATA TAT CTG ACC AGG GTC CTA 528 6 
Val Ser Ser Phe Pro Ser lie Arg Pro lie Tyr Leu Thr Arg Val Leu 
1415 1420 1425 

TTC CAA GAC AAC TCT TCT CAT CTT CCA GCA GCA TCT TAT AGA AAG AAA 5334 
Phe Gin Asp Asn Ser Ser His Leu Pro Ala Ala Ser Tyr Arg Lys Lys 
1430 1435 1440 

GAT TCT GGG GTC CAA GAA AGC AGT CAT TTC TTA CAA GGA GCC AAA AAA 5382 
Asp Ser Gly Val Gin Glu Ser Ser His Phe Leu Gin Gly Ala Lys Lys 
1445 1450 1455 

AAT AAC CTT TCT TTA GCC ATT CTA ACC TTG GAG ATG ACT GGT GAT CAA 54 30 

Asn Asn Leu Ser Leu Ala lie Leu Thr Leu Glu Met Thr Gly Asp Gin 
1460 1465 1470 1475 

AGA GAG GTT GGC TCC CTG GGG ACA AGT GCC ACA AAT TCA GTC ACA TAC 54 7 8 

Arg Glu Val Gly Ser Leu Gly Thr Ser Ala Thr Asn Ser Val Thr Tyr 
1480 1485 1490 

AAG AAA GTT GAG AAC ACT GTT CTC CCG AAA CCA GAC TTG CCC AAA ACA 5526 
Lys Lys Val Glu Asn Thr Val Leu Pro Lys Pro Asp Leu Pro Lys Thr 
1495 1500 1505 
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TCT GGC AAA GTT GAA TTG CTT CCA AAA GTT CAC ATT TAT CAG AAG GAC 557 4 

Ser Gly Lys Val Glu Leu Leu Pro Lys Val His lie Tyr Gin Lys Asp 
1510 1515 1520 

CTA TTC CCT ACG GAA ACT AGC AAT GGG TCT CCT GGC CAT CTG GAT CTC 5 622 
Leu Phe Pro Thr Glu Thr Ser Asn Gly Ser Pro Gly His Leu Asp Leu 
1525 1530 1535 

GTG GAA GGG AGC CTT CTT CAG GGA ACA GAG GGA GCG ATT AAG TGG AAT 5 670 

Val Glu Gly Ser Leu Leu Gin Gly Thr Glu Gly Ala He Lys Trp Asn 
1540 1545 1550 1555 

GAA GCA AAC AGA CCT GGA AAA GTT CCC TTT CTG AGA GTA GCA ACA GAA 5718 
Glu Ala Asn Arg Pro Gly Lys Val Pro Phe Leu Arg Val Ala Thr Glu 
1560 1565 1570 

AGC TCT GCA AAG ACT CCC TCC AAG CTA TTG GAT CCT CTT GCT TGG GAT 57 66 

Ser Ser Ala Lys Thr Pro Ser Lys Leu Leu Asp Pro Leu Ala Trp Asp 
1575 1580 1585 

AAC CAC TAT GGT ACT CAG ATA CCA AAA GAA GAG TGG AAA TCC CAA GAG 5814 
Asn His Tyr Gly Thr Gin He Pro Lys Glu Glu Trp Lys Ser Gin Glu 
1590 1595 1600 

AAG TCA CCA GAA AAA ACA GCT TTT AAG AAA AAG GAT ACC ATT TTG TCC 5 8 62 

Lys Ser Pro Glu Lys Thr Ala Phe Lys Lys Lys Asp Thr He Leu Ser 
1605 1610 1615 

CTG AAC GCT TGT GAA AGC AAT CAT GCA ATA GCA GCA ATA AAT GAG GGA 5 910 

Leu Asn Ala Cys Glu Ser Asn His Ala He Ala Ala He Asn Glu Gly 
1620 1625 1630 1635 

CAA AAT AAG CCC GAA ATA GAA GTC ACC TGG GCA AAG CAA GGT AGG ACT 5 958 

Gin Asn Lys Pro Glu lie Glu Val Thr Trp Ala Lys Gin Gly Arg Thr 
1640 1645 1650 

GAA AGG CTG TGC TCT CAA AAC CCA CCA GTC TTG AAA CGC CAT CAA CGG 600 6 

Glu Arg Leu Cys Ser Gin Asn Pro Pro Val Leu Lys Arg His Gin Arg 
1655 1660 1665 

GAA ATA ACT CGT ACT ACT CTT CAG TCA GAT CAA GAG GAA ATT GAC TAT 6054 
Glu lie Thr Arg Thr Thr Leu Gin Ser Asp Gin Glu Glu He Asp Tyr 
1670 1675 1680 

GAT GAT ACC ATA TCA GTT GAA ATG AAG AAG GAA GAT TTT GAC ATT TAT 6102 
Asp Asp Thr He Ser Val Glu Met Lys Lys Glu Asp Phe Asp lie Tyr 
1685 1690 1695 

GAT GAG GAT GAA AAT CAG AGC CCC CGC AGC TTT CAA AAG AAA ACA CGA 6150 
Asp Glu Asp Glu Asn Gin Ser Pro Arg Ser Phe Gin Lys Lys Thr Arg 
1700 1705 1710 1715 

CAC TAT TTT ATT GCT GCA GTG GAG AGG CTC TGG GAT TAT GGG ATG AGT 6198 
His Tyr Phe lie Ala Ala Val Glu Arg Leu Trp Asp Tyr Gly Met Ser 
1720 1725 1730 
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r^^^ CTA AGA AAC AGG GCT CAG AGT . 



AGC TCC CCA CAT GTT CTA AGA AAC AGG GCT CAG AGT GGC AGT GTC CCT 624 6 

Ser Ser Pro His Val Leu Arg Asn Arg Ala Gin Ser Gly Ser Val Pro 
1735 1740 1745 



CAG TTC AAG AAA GTT GTT TTC CAG GAA TTT ACT GAT GGC TCC TTT ACT 62 94 

Gin Phe Lys Lys Val Val Phe Gin Glu Phe Thr Asp Gly Ser Phe Thr 
1750 1755 1760 



CAG CCC TTA TAC CGT GGA GAA CTA AAT GAA CAT TTG GGA CTC CTG GGG 634 2 

Gin Pro Leu Tyr Arg Gly Glu Leu Asn Glu His Leu Gly Leu Leu Gly 
1765 1770 1775 

CCA TAT ATA AGA GCA GAA GTT GAA GAT AAT ATC ATG GTA ACT TTC AGA 6390 
Pro Tyr lie Arg Ala Glu Val Glu Asp Asn lie Met Val Thr Phe Arg 
1780 1785 1790 1795 

AAT CAG GCC TCT CGT CCC TAT TCC TTC TAT TCT AGC CTT ATT TCT TAT 64 38 

Asn Gin Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Ser Leu lie Ser Tyr 
1800 1805 1810 



GAG GAA GAT CAG AGG CAA GGA GCA GAA CCT AGA AAA AAC TTT GTC AAG 64 8 6 

Glu Glu Asp Gin Arg Gin Gly Ala Glu Pro Arg Lys Asn Phe Val Lys 
1815 1820 1825 



CCT AAT GAA ACC AAA ACT TAC TTT TGG AAA GTG CAA CAT CAT ATG GCA 6534 
Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val Gin His His Met Ala 
1830 1835 1840 



CCC ACT AAA GAT GAG TTT GAC TGC AAA GCC TGG GCT TAT TTC TCT GAT 6582 
Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp Ala Tyr Phe Ser Asp 
1845 1850 1855 

GTT GAC CTG GAA AAA GAT GTG CAC TCA GGC CTG ATT GGA CCC CTT CTG 6630 
Val Asp Leu Glu Lys Asp Val His Ser Gly Leu lie Gly Pro Leu Leu 
1860 1865 1870 1875 

GTC TGC CAC ACT AAC ACA CTG AAC CCT GCT CAT GGG AGA CAA GTG ACA 6 67 8 

Val Cys His Thr Asn Thr Leu Asn Pro Ala His Gly Arg Gin Val Thr 
1880 1885 1890 



GTA CAG GAA TTT GCT CTG TTT TTC ACC ATC TTT GAT GAG ACC AAA AGC 67 2 6 

Val Gin Glu Phe Ala Leu Phe Phe Thr lie Phe Asp Glu Thr Lys Ser 
1895 1900 1905 

TGG TAC TTC ACT GAA AAT ATG GAA AGA AAC TGC AGG GCT CCC TGC AAT 67 7 4 

Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys Arg Ala Pro Cys Asn 
1910 1915 1920 



ATC CAG ATG GAA GAT CCC ACT TTT AAA GAG AAT TAT CGC TTC CAT GCA 6822 

lie Gin Met Glu Asp Pro Thr Phe Lys Glu Asn Tyr Arg Phe His Ala 
1925 1930 1935 

ATC AAT GGC TAC ATA ATG GAT ACA CTA CCT GGC TTA GTA ATG GCT CAG 687 0 

lie Asn Gly Tyr lie Met Asp Thr Leu Pro Gly Leu Val Met Ala Gin 

1940 1945 1950 1955 
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GAT CAA AGG ATT CGA TGG TAT CTG CTC AGC ATG GGC AGC AAT GAA AAC 6918 
Asp Gin Arg lie Arg Trp Tyr Leu Leu Ser Met Gly Ser Asn Glu Asn 
1960 1965 1970 

ATC CAT TCT ATT CAT TTC AGT GGA CAT GTG TTC ACT GTA CGA AAA AAA 6966 
lie His Ser lie His Phe Ser Gly His Val Phe Thr Val Arg Lys Lys 
1975 1980 1985 

GAG GAG TAT AAA ATG GCA CTG TAC AAT CTC TAT CCA GGT GTT TTT GAG 7014 
Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr Pro Gly Val Phe Glu 
1990 1995 2000 

ACA GTG GAA ATG TTA CCA TCC AAA GCT GGA ATT TGG CGG GTG GAA TGC 70 62 

Thr Val Glu Met Leu Pro Ser Lys Ala Gly lie Trp Arg Val Glu Cys 
2005 2010 2015 

CTT ATT GGC GAG CAT CTA CAT GCT GGG ATG AGC ACA CTT TTT CTG GTG 7110 
Leu lie Gly Glu His Leu His Ala Gly Met Ser Thr Leu Phe Leu Val 
2020 2025 2030 2035 

TAC AGC AAT AAG TGT CAG ACT CCC CTG GGA ATG GCT TCT GGA CAC ATT 7158 
Tyr Ser Asn Lys Cys Gin Thr Pro Leu Gly Met Ala Ser Gly His lie 
2040 2045 2050 

AGA GAT TTT CAG ATT ACA GCT TCA GGA CAA TAT GGA CAG TGG GCC CCA 7206 
Arg Asp Phe Gin lie Thr Ala Ser Gly Gin Tyr Gly Gin Trp Ala Pro 
2055 2060 2065 

AAG CTG GCC AGA CTT CAT TAT TCC GGA TCA ATC AAT GCC TGG AGC ACC 72 54 

Lys Leu Ala Arg Leu His Tyr Ser Gly Ser lie Asn Ala Trp Ser Thr 
2070 2075 2080 

AAG GAG CCC TTT TCT TGG ATC AAG GTG GAT CTG TTG GCA CCA ATG ATT 7 302 

Lys Glu Pro Phe Ser Trp lie Lys Val Asp Leu Leu Ala Pro Met lie 
2085 2090 2095 

ATT CAC GGC ATC AAG ACC CAG GGT GCC CGT CAG AAG TTC TCC AGC CTC 7 350 

lie His Gly lie Lys Thr Gin Gly Ala Arg Gin Lys Phe Ser Ser Leu 
2100 2105 2110 2115 

TAC ATC TCT CAG TTT ATC ATC ATG TAT AGT CTT GAT GGG AAG AAG TGG 7 398 

Tyr lie Ser Gin Phe lie lie Met Tyr Ser Leu Asp Gly Lys Lys Trp 
2120 2125 2130 

CAG ACT TAT CGA GGA AAT TCC ACT GGA ACC TTA ATG GTC TTC TTT GGC 74 46 

Gin Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu Met Val Phe Phe Gly 
2135 2140 2145 

AAT GTG GAT TCA TCT GGG ATA AAA CAC AAT ATT TTT AAC CCT CCA ATT 74 94 

Asn Val Asp Ser Ser Gly lie Lys His Asn lie Phe Asn Pro Pro lie 
2150 2155 2160 

ATT GCT CGA TAC ATC CGT TTG CAC CCA ACT CAT TAT AGC ATT CGC AGC 7 54 2 

lie Ala Arg Tyr lie Arg Leu His Pro Thr His Tyr Ser lie Arg Ser 
2165 • -2170 2175 
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ACT CTT CGC ATG GAG TTG ATG GGC TGT GAT TTA AAT AGT TGC AGC ATG 7 5 90 

Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu Asn Ser Cys Ser Met 
2180 2185 2190 2195 

CCA TTG GGA ATG GAG AGT AAA GCA ATA TCA GAT GCA CAG ATT ACT GCT 7 638 

Pro Leu Gly Met Glu Ser Lys Ala lie Ser Asp Ala Gin lie Thr Ala 
2200 2205 2210 

TCA TCC TAC TTT ACC AAT ATG TTT GCC ACC TGG TCT CCT TCA AAA GCT 7 68 6 

Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser Pro Ser Lys Ala 
2215 2220 " 2225 

CGA CTT CAC CTC CAA GGG AGG AGT AAT GCC TGG AGA CCT CAG GTG AAT 7 7 34 

Arg Leu His Leu Gin Gly Arg Ser Asn Ala Trp Arg Pro Gin Val Asn 
2230 2235 2240 

AAT CCA AAA GAG TGG CTG CAA GTG GAC TTC CAG AAG ACA ATG AAA GTC 7 7 82 

Asn Pro Lys Glu Trp Leu Gin Val Asp Phe Gin Lys Thr Met Lys Val 
2245 2250 2255 

ACA GGA GTA ACT ACT CAG GGA GTA AAA TCT CTG CTT ACC AGC ATG TAT 7 8 30 

Thr Gly Val Thr Thr Gin Gly Val Lys Ser Leu Leu Thr Ser Met Tyr 
2260 2265 2270 2275 

GTG AAG GAG TTC CTC ATC TCC AGC AGT CAA GAT GGC CAT CAG TGG ACT 7 8 78 

Val Lys Glu Phe Leu lie Ser Ser Ser Gin Asp Gly His Gin Trp Thr 
2280 2285 2290 

CTC TTT TTT CAG AAT GGC AAA GTA AAG GTT TTT CAG GGA AAT CAA GAC 7 92 6 

Leu Phe Phe Gin Asn Gly Lys Val Lys Val Phe Gin Gly Asn Gin Asp 
2295 2300 2305 

TCC TTC ACA CCT GTG GTG AAC TCT CTA GAC CCA CCG TTA CTG ACT CGC 7 97 4 

Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro Pro Leu Leu Thr Arg 
2310 2315 2320 

TAC CTT CGA ATT CAC CCC CAG AGT TGG GTG CAC CAG ATT GCC CTG AGG 8022 
Tyr Leu Arg lie His Pro Gin Ser Trp Val His Gin lie Ala Leu Arg 
2325 2330 2335 

ATG GAG GTT CTG GGC TGC GAG GCA CAG GAC CTC TAC TGAGGGTGGC 80 68 
Met Glu Val Leu Gly Cys Glu Ala Gin Asp Leu Tyr 
2340 2345 2350 

CAC TGC AGC A CCTGCCACTG CCGTCACCTC TCCCTCCTCA GCTCCAGGGC AGTGTCCCTC 8128 

CCTGGCTTGC CTTCTACCTT TGTGCTAAAT CCTAGCAGAC ACTGCCTTGA AGCCTCCTGA 8188 

ATTAACTATC ATCAGTCCTG CATTTCTTTG GTGGGGGGCC AGGAGGGTGC ATCCAATTTA 82 4 8 

ACTTAACTCT TACCTATTTT CTGCAGCTGC TCCCAGATTA CTCCTTCCTT CCAATATAAC 8 308 

TAGGCAAAAA GAAGTGAGGA GAAACCTGCA TGAAAGCATT CTTCCCTGAA AAGTTAGGCC 8 368 

TCTCAGAGTC ACCACTTCCT CTGTTGTAGA AAAACTATGT GATGAAACTT TGAAAAAGAT 8 4 28 

ATTTATGATG TTAACTTGTT TATTGCAGCT TATAATGGTT ACAAATAAAG CAATAGCATC 84 88 
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ACAAATTTCA CAAATAAAGC ATTTTTTTCA CTGCATTCTA GTTGTGGTTT GTCCAAACTC 854 8 

ATCAATGTAT CTTATCATGT CTGGATCCCC GGGTGGCATC CCTGTGACCC CTCCCCAGTG 8 608 

CCTCTCCTGG CCCTGGAAGT TGCCACTCCA GTGCCCACCA GCCTTGTCCT AATAAAATTA 8 668 

AGTTGCATCA TTTTGTCTGA CTAGGTGTCC TTCTATAATA TTATGGGGTG GAGGGGGGTG 8728 

GTATGGAGCA AGGGGCAAGT TGGGAAGACA ACCTGTAGGG CCTGCGGGGT CTATTCGGGA 8788 

ACCAAGCTGG AGTGCAGTGG CACAATCTTG GCTCACTGCA ATCTCCGCCT CCTGGGTTCA 8 84 8 

AGCGATTCTC CTGCCTCAGC CTCCCGAGTT GTTGGGATTC CAGGCATGCA TGACCAGGCT 8 908 

CAGCTAATTT TTGTTTTTTT GGTAGAGACG GGGTTTCACC ATATTGGCCA GGCTGGTCTC 8 968 

CAACTCCTAA TCTCAGGTGA TCTACCCACC TTGGCCTCCC AAATTGCTGG GATTACAGGC 9028 

GTGAACCACT GCTCCCTTCC CTGTCCTTCT GATTTTAAAA TAACTATACC AGCAGGAGGA 908 8 

CGTCCAGACA CAGCATAGGC TACCTGCCAT GCCCAACCGG TGGGACATTT GAGTTGCTTG 914 8 

CTTGGCACTG TCCTCTCATG CGTTGGGTCC ACTCAGTAGA TGCCTGTTGA ATTCGTAATC 9208 

ATGGTCATAG CTGTTTCCTG TGTGAAATTG TTATCCGCTC ACAATTCCAC ACAACATACG 92 68 

AGCCGGAAGC ATAAAGTGTA AAGCCTGGGG TGCCTAATGA GTGAGCTAAC TCACATTAAT 9328 

TGCGTTGCGC TCACTGCCCG CTTTCCAGTC GGGAAACCTG TCGTGCCAGC TGCATTAATG 938 8 

AATCGGCCAA CGCGCGGGGA GAGGCGGTTT GCGTATTGGG CGCTCTTCCG CTTCCTCGCT 94 4 8 

CACTGACTCG CTGCGCTCGG TCGTTCGGCT GCGGCGAGCG GTATCAGCTC ACTCAAAGGC 9508 

GGTAATACGG TTATCCACAG AATCAGGGGA TAACGCAGGA AAGAACATGT GAGCAAAAGG 95 68 

CCAGCAAAAG GCCAGGAACC GTAAAAAGGC CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG 9628 

CCCCCCTGAC GAGCATCACA AAAATCGACG CTCAAGTCAG AGGTGGCGAA ACCCGACAGG 9 68 8 

ACTATAAAGA TACCAGGCGT TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC CTGTTCCGAC 97 4 8 

CCTGCCGCTT ACCGGATACC TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG CGCTTTCTCA 9808 

TAGCTCACGC TGTAGGTATC TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC TGGGCTGTGT 98 68 

GCACGAACCC CCCGTTCAGC CCGACCGCTG CGCCTTATCG GGTAACTATC GTCTTGAGTC 9928 

CAACCCGGTA AGACACGACT TATCGCCACT GGCAGCAGCC ACTGGTAACA GGATTAGCAG 9988 

AGCGAGGTAT GTAGGCGGTG CTACAGAGTT CTTGAAGTGG TGGCCTAACT ACGGCTACAC 1004 8 

TAGAAGGACA GTATTTGGTA TCTGCGCTCT GCTGAAGCCA GTTACCTTCG GAAAAAGAGT 10108 

TGGTAGCTCT TGATCCGGCA AACAAACCAC CGCTGGTAGC GGTGGTTTTT TTGTTTGCAA 10168 

GCAGCAGATT ACGCGCAGAA AAAAAGGATC TCAAGAAGAT CCTTTGATCT TTTCTACGGG 10228 
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GTCTGACGCT CAGTi 




XTG AAAACTCACG TTAAGGGATT TTGGTCATGA GATTATCAAA 10288 




AAGGATCTTC ACCTAGATCC TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT 1034 8 
ATATGAGTAA ACTTGGTCTG ACAGTTACCA ATGCTTAATC AGTGAGGCAC CTATCTCAGC 104 08 
GATCTGTCTA TTTCGTTCAT CCATAGTTGC CTGACTCCCC GTCGTGTAGA TAACTACGAT 104 68 
ACGGGAGGGC TTACCATCTG GCCCCAGTGC TGCAATGATA CCGCGAGACC CACGCTCACC 10528 
GGCTCCAGAT TTATCAGCAA TAAACCAGCC AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC 10588 
TGCAACTTTA TCCGCCTCCA TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG 1064 8 
TTCGCCAGTT AATAGTTTGC GCAACGTTGT TGCCATTGCT ACAGGCATCG TGGTGTCACG 10708 
CTCGTCGTTT GGTATGGCTT CATTCAGCTC CGGTTCCCAA CGATCAAGGC GAGTTACATG 10768 
ATCCCCCATG TTGTGCAAAA AAGCGGTTAG CTCCTTCGGT CCTCCGATCG TTGTCAGAAG 10828 
TAAGTTGGCC GCAGTGTTAT CACTCATGGT TATGGCAGCA CTGCATAATT CTCTTACTGT 10888 
CATGCCATCC GTAAGATGCT TTTCTGTGAC TGGTGAGTAC TCAACCAAGT CATTCTGAGA 1094 8 
ATAGTGTATG CGGCGACCGA GTTGCTCTTG CCCGGCGTCA ATACGGGATA ATACCGCGCC 11008 
ACATAGCAGA ACTTTAAAAG TGCTCATCAT TGGAAAACGT TCTTCGGGGC GAAAACTCTC 110 68 
AAGGATCTTA CCGCTGTTGA GATCCAGTTC GATGTAACCC ACTCGTGCAC CCAACTGATC 11128 
TTCAGCATCT TTTACTTTCA CCAGCGTTTC TGGGTGAGCA AAAACAGGAA GGCAAAATGC 11188 
CGCAAAAAAG GGAATAAGGG C G AC AC G G AA ATGTTGAATA CTCATACTCT TCCTTTTTCA 112 4 8 
ATATTATTGA AGCATTTATC AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT 11308 
T T AG AAAAAT AAACAAATAG GGGTTCCGCG CACATTTCCC CGAAAAGTGC CACCTGACGT 11368 
CTAAGAAACC ATTATTATCA TGACATTAAC CTAT AAAAAT AGGCGTATCA CGAGGCCCTT 114 28 
TCGTCTCGCG CGTTTCGGTG ATGACGGTGA AAACCTCTGA CACATGCAGC TCCCGGAGAC 114 8 8 
GGTCACAGCT TGTCTGTAAG CGGATGCCGG GAG C AG AC AA GCCCGTCAGG GCGCGTCAGC 11548 
GGGTGTTGGC GGGTGTCGGG GCTGGCTTAA CTATGCGGCA TCAGAGCAGA TTGTACTGAG 11608 
AGTGCACCAT ATGCGGTGTG AAATACCGCA CAGATGCGTA AG G AG AAAAT ACCGCATCAG 11668 
GCGCCATTCG CCATTCAGGC TGCGCAACTG TTGGGAAGGG CGATCGGTGC GGGCCTCTTC 11728 
GCTATTACGC CAGCTGGCGA AAGGGGGATG TGCTGCAAGG CGATTAAGTT GGGTAACGCC 117 8 8 
AGGGTTTTCC CAGTCACGAC GTTGTAAAAC GACGGCCAGT GCCAAGCTTG GGCTGCAG 118 4 6 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 211 base pairs 
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rYPE: 




(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATTGAACCAA GAAGCTTCTC CCAGGTAAGT TGCTAATAAA GCTTGGCAAG AGTATTTCAA 60 

GGAAGATGAA GTCATTAACT ATGCAAAATG CTTCTCAGGC ACCTAGGAAA ATGAGGATGT 120 

GAGGCATTTC TACCCACTTG GTACATAAAA TTATTGCTTT TCCTCTTCTT TTTTTCTCCA 180 

GAACCCACCA GTCTTGAAAC GCCATCAACG G 211 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GTTGGTATCC TTTTTACAGC ACAACTTAAT GAGACAGATA GAAACTGGTC TTGTAGAAAC 60 
AGAGTAGTCG CCTGCTTTTC TGCCAGGTGC TGACTTCTCT CCCCTGGGCT GTTTTCATTT 12 0 
TCTCAG 12 6 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

GTAAGTATCC TTTTTACAGC ACAACTTAAT GAGACAGATA GAAACTGGTC TTGTAGAAAC 60 

AGAGTAGTCG CCTGCTTTTC TGCCAGGTGC TGACTTCTCT CCCCTTCTCT TTTTTCCTTT 120 

TCTCAG 12 6 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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4 




(D) TO^LOGY: 




linear 



(ii) MOLECULE TYPE: 



cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



GCCACCAUGG 



10 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
AGGTTAATTT T T AAAAAGC A GTCAAAAGTC CAAGTGGCCC TTGCGAGCAT TTACTCTCTC 60 
TGTTTGCTCT GGTTAATAAT CTCAGGAGCA CAAACATTCC 100 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTTTCTCTTT TCTTTTACAT GAAGGGTCTG GCAGCCAAAG CAATCACTCA AAGTTCAAAC 60 
CTTATCATTT TTTGCTTTGT TCCTCTTGGC CTTGGTTTTG TACATCAGCT TTGAAAATAC 120 
CATCCCAGGG TTAATGCTGG GGTTAATTTA TAACTAAGAG TGCTCTAGTT TTGCAATACA 18 0 
GG AC AT GCT A TAAAAATGGA AAGATGTTGC TTTCTGAGAG ATA 223 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
AGAUCUCGAG AAAGCUAACA ACAAAGAACA ACAAACAACA AUCAGGAUAA CAAGAACGAA 60 
ACAAUAACAG CCACCAUGGA AAUAGAGCUC 90 
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